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Preface 


The area of adaptive control has grown to be one of the richest in terms of 
algorithms, design techniques, analytical tools, and modifications. Several 
books and research monographs already exist on the topics of parameter 
estimation and adaptive control. 

Despite this rich literature, the field of adaptive control may easily appear 
to an outsider as a collection of unrelated tricks and modifications. Students 
are often overwhelmed and sometimes confused by the vast number of what 
appear to be unrelated designs and analytical methods achieving similar re- 
sults. Researchers concentrating on different approaches in adaptive control 
often find it difficult to relate their techniques with others without additional 
research efforts. 

The purpose of this book is to alleviate some of the confusion and diffi- 
culty in understanding the design, analysis, and robustness of a wide class 
of adaptive control for continuous-time plants. The book is the outcome of 
several years of research, whose main purpose was not to generate new re- 
sults, but rather unify, simplify, and present in a tutorial manner most of the 
existing techniques for designing and analyzing adaptive control systems. 

The book is written in a self-contained fashion to be used as a textbook 
on adaptive systems at the senior undergraduate, or first and second gradu- 
ate level. It is assumed that the reader is familiar with the materials taught 
in undergraduate courses on linear systems, differential equations, and auto- 
matic control. The book is also useful for an industrial audience where the 
interest is to implement adaptive control rather than analyze its stability 
properties. Tables with descriptions of adaptive control schemes presented 
in the book are meant to serve this audience. The personal computer floppy 
disk, included with the book, provides several examples of simple adaptive 
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control systems that will help the reader understand some of the implemen- 
tation aspects of adaptive systems. 

A significant part of the book, devoted to parameter estimation and 
learning in general, provides techniques and algorithms for on-line fitting 
of dynamic or static models to data generated by real systems. The tools 
for design and analysis presented in the book are very valuable in under- 
standing and analyzing similar parameter estimation problems that appear 
in neural networks, fuzzy systems, and other universal approximators. The 
book will be of great interest to the neural and fuzzy logic audience who 
will benefit from the strong similarity that exists between adaptive systems, 
whose stability properties are well established, and neural networks, fuzzy 
logic systems where stability and convergence issues are yet to be resolved. 

The book is organized as follows: Chapter 1 is used to introduce adap- 
tive control as a method for controlling plants with parametric uncertainty. 
It also provides some background and a brief history of the development 
of adaptive control. Chapter 2 presents a review of various plant model 
representations that are useful for parameter identification and control. A 
considerable number of stability results that are useful in analyzing and un- 
derstanding the properties of adaptive and nonlinear systems in general are 
presented in Chapter 3. Chapter 4 deals with the design and analysis of on- 
line parameter estimators or adaptive laws that form the backbone of every 
adaptive control scheme presented in the chapters to follow. The design of 
parameter identifiers and adaptive observers for stable plants is presented 
in Chapter 5. Chapter 6 is devoted to the design and analysis of a wide 
class of model reference adaptive controllers for minimum phase plants. The 
design of adaptive control for plants that are not necessarily minimum phase 
is presented in Chapter 7. These schemes are based on pole placement con- 
trol strategies and are referred to as adaptive pole placement control. While 
Chapters 4 through 7 deal with plant models that are free of disturbances, 
unmodeled dynamics and noise, Chapters 8 and 9 deal with the robustness 
issues in adaptive control when plant model uncertainties, such as bounded 
disturbances and unmodeled dynamics, are present. 

The book can be used in various ways. The reader who is familiar with 
stability and linear systems may start from Chapter 4. An introductory 
course in adaptive control could be covered in Chapters 1, 2, and 4 to 9, 
by excluding the more elaborate and difficult proofs of theorems that are 
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presented either in the last section of chapters or in the appendices. Chapter 
3 could be used for reference and for covering relevant stability results that 
arise during the course. A higher-level course intended for graduate students 
that are interested in a deeper understanding of adaptive control could cover 
all chapters with more emphasis on the design and stability proofs. A course 
for an industrial audience could contain Chapters 1, 2, and 4 to 9 with 
emphasis on the design of adaptive control algorithms rather than stability 
proofs and convergence. 
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Chapter 1 


Introduction 


1.1 Control System Design Steps 

The design of a controller that can alter or modify the behavior and response 
of an unknown plant to meet certain performance requirements can be a 
tedious and challenging problem in many control applications. By plant, we 
mean any process characterized by a certain number of inputs u and outputs 
y. as shown in Figure 1.1. 

The plant inputs u are processed to produce several plant outputs y that 
represent the measured output response of the plant. The control design task 
is to choose the input u so that the output response y(t) satisfies certain given 
performance requirements. Because the plant process is usually complex, 
i.e. , it may consist of various mechanical, electronic, hydraulic parts, etc., 
the appropriate choice of u is in general not straightforward. The control 
design steps often followed by most control engineers in choosing the input 
u are shown in Figure 1.2 and are explained below. 


Inputs. 
u ~ 


Plant 

Process 

P 


> 


Outputs 

y 


Figure 1.1 Plant representation. 
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Step 1. Modeling 

The task of the control engineer in this step is to understand the pro- 
cessing mechanism of the plant, which takes a given input signal u{t ) and 
produces the output response y(t), to the point that he or she can describe 
it in the form of some mathematical equations. These equations constitute 
the mathematical model of the plant. An exact plant model should produce 
the same output response as the plant, provided the input to the model and 
initial conditions are exactly the same as those of the plant. The complexity 
of most physical plants, however, makes the development of such an exact 
model unwarranted or even impossible. But even if the exact plant model 
becomes available, its dimension is likely to be infinite, and its description 
nonlinear or time varying to the point that its usefulness from the control 
design viewpoint is minimal or none. This makes the task of modeling even 
more difficult and challenging, because the control engineer has to come up 
with a mathematical model that describes accurately the input/output be- 
havior of the plant and yet is simple enough to be used for control design 
purposes. A simple model usually leads to a simple controller that is easier 
to understand and implement, and often more reliable for practical purposes. 

A plant model may be developed by using physical laws or by processing 
the plant input /output (I/O) data obtained by performing various experi- 
ments. Such a model, however, may still be complicated enough from the 
control design viewpoint and further simplifications may be necessary. Some 
of the approaches often used to obtain a simplified model are 

(i) Linearization around operating points 

(ii) Model order reduction techniques 

In approach (i) the plant is approximated by a linear model that is valid 
around a given operating point. Different operating points may lead to 
several different linear models that are used as plant models. Linearization 
is achieved by using Taylor’s series expansion and approximation, fitting of 
experimental data to a linear model, etc. 

In approach (ii) small effects and phenomena outside the frequency range 
of interest are neglected leading to a lower order and simpler plant model. 
The reader is referred to references [67, 106] for more details on model re- 
duction techniques and approximations. 
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Figure 1.2 Control system design steps. 


In general, the task of modeling involves a good understanding of the 
plant process and performance requirements, and may require some experi- 
ence from the part of the control engineer. 

Step 2. Controller Design 

Once a model of the plant is available, one can proceed with the controller 
design. The controller is designed to meet the performance requirements for 
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the plant model. If the model is a good approximation of the plant, then 
one would hope that the controller performance for the plant model would 
be close to that achieved when the same controller is applied to the plant. 

Because the plant model is always an approximation of the plant, the 
effect of any discrepancy between the plant and the model on the perfor- 
mance of the controller will not be known until the controller is applied to 
the plant in Step 3. One, however, can take an intermediate step and ana- 
lyze the properties of the designed controller for a plant model that includes 
a class of plant model uncertainties denoted by A that are likely to appear 
in the plant. If A represents most of the unmodeled plant phenomena, its 
representation in terms of mathematical equations is not possible. Its char- 
acterization, however, in terms of some known bounds may be possible in 
many applications. By considering the existence of a general class of uncer- 
tainties A that are likely to be present in the plant, the control engineer may 
be able to modify or redesign the controller to be less sensitive to uncertain- 
ties, i.e. , to be more robust with respect to A. This robustness analysis and 
redesign improves the potential for a successful implementation in Step 3. 

Step 3. Implementation 

In this step, a controller designed in Step 2, which is shown to meet the 
performance requirements for the plant model and is robust with respect to 
possible plant model uncertainties A, is ready to be applied to the unknown 
plant. The implementation can be done using a digital computer, even 
though in some applications analog computers may be used too. Issues, 
such as the type of computer available, the type of interface devices between 
the computer and the plant, software tools, etc., need to be considered a 
priori. Computer speed and accuracy limitations may put constraints on 
the complexity of the controller that may force the control engineer to go 
back to Step 2 or even Step 1 to come up with a simpler controller without 
violating the performance requirements. 

Another important aspect of implementation is the final adjustment, 
or as often called the tuning, of the controller to improve performance by 
compensating for the plant model uncertainties that are not accounted for 
during the design process. Tuning is often done by trial and error, and 
depends very much on the experience and intuition of the control engineer. 

In this book we will concentrate on Step 2. We will be dealing with 
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the design of control algorithms for a class of plant models described by the 
linear differential equation 

x = Ax + Bu, x(0) = xo Hill 

y = C T x + Du [ ' ' ’ 

In (1.1.1) x E TV is the state of the model, u E TV the plant input, and y E 
IZ 1 the plant model output. The matrices A E 7 Z nxn , B E 7 Z nxr , C E TV* 1 , 
and D E TZ lxr could be constant or time varying. This class of plant models 
is quite general because it can serve as an approximation of nonlinear plants 
around operating points. A controller based on the linear model (1.1.1) is 
expected to be simpler and easier to understand than a controller based on 
a possibly more accurate but nonlinear plant model. 

The class of plant models given by (1.1.1) can be generalized further if we 
allow the elements of A, B, and C to be completely unknown and changing 
with time or operating conditions. The control of plant models (1.1.1) with 
A, B, C, and D unknown or partially known is covered under the area of 
adaptive systems and is the main topic of this book. 


1.2 Adaptive Control 

According to Webster’s dictionary, to adapt means “to change (oneself) so 
that one’s behavior will conform to new or changed circumstances.” The 
words “adaptive systems” and “adaptive control” have been used as early 
as 1950 [10, 27]. 

The design of autopilots for high-performance aircraft was one of the pri- 
mary motivations for active research on adaptive control in the early 1950s. 
Aircraft operate over a wide range of speeds and altitudes, and their dy- 
namics are nonlinear and conceptually time varying. For a given operating 
point, specified by the aircraft speed (Mach number) and altitude, the com- 
plex aircraft dynamics can be approximated by a linear model of the same 
form as (1.1.1). For example, for an operating point i, the linear aircraft 
model has the following form [140]: 

x = Aa + Biu, ,x(0) = x 0 n o -W 

y = Cj x + Diu 1 j 

where Ai,Bi,Ci , and Dj are functions of the operating point i. As the air- 
craft goes through different flight conditions, the operating point changes 
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Figure 1.3 Controller structure with adjustable controller gains. 

leading to different values for Ai,Bi,Ci : and D{. Because the output re- 
sponse y(t) carries information about the state x as well as the parameters, 
one may argue that in principle, a sophisticated feedback controller should 
be able to learn about parameter changes by processing y{t ) and use the 
appropriate gains to accommodate them. This argument led to a feedback 
control structure on which adaptive control is based. The controller struc- 
ture consists of a feedback loop and a controller with adjustable gains as 
shown in Figure 1.3. The way of changing the controller gains in response 
to changes in the plant and disturbance dynamics distinguishes one scheme 
from another. 


1.2.1 Robust Control 

A constant gain feedback controller may be designed to cope with parameter 
changes provided that such changes are within certain bounds. A block 
diagram of such a controller is shown in Figure 1.4 where G(s) is the transfer 
function of the plant and C(s) is the transfer function of the controller. The 
transfer function from y* to y is 


y _ C(s)G(s) 
y* 1 + C{s)G{s) 


(1.2.2) 


where C(s) is to be chosen so that the closed-loop plant is stable, despite 
parameter changes or uncertainties in G(s), and y ~ y* within the frequency 
range of interest. This latter condition can be achieved if we choose C(s) 
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Figure 1.4 Constant gain feedback controller. 

so that the loop gain \C{jw)G{jw)\ is as large as possible in the frequency 
spectrum of y* provided, of course, that large loop gain does not violate 
closed-loop stability requirements. The tracking and stability objectives can 
be achieved through the design of C(s) provided the changes within G(s) 
are within certain bounds. More details about robust control will be given 
in Chapter 8. 

Robust control is not considered to be an adaptive system even though 
it can handle certain classes of parametric and dynamic uncertainties. 

1.2.2 Gain Scheduling 

Let us consider the aircraft model (1.2.1) where for each operating point 
i, i = 1,2, . . . , N, the parameters Ai,Bi,Ci, and Di are known. For a 
given operating point i, a feedback controller with constant gains, say 0,;, 
can be designed to meet the performance requirements for the correspond- 
ing linear model. This leads to a controller, say C(0), with a set of gains 
{01, 02, ..., di, ..., On} covering N operating points. Once the operating point, 
say i, is detected the controller gains can be changed to the appropriate value 
of 9i obtained from the precomputed gain set. Transitions between different 
operating points that lead to significant parameter changes may be handled 
by interpolation or by increasing the number of operating points. The two 
elements that are essential in implementing this approach is a look-up table 
to store the values of 0* and the plant auxiliary measurements that corre- 
late well with changes in the operating points. The approach is called gain 
scheduling and is illustrated in Figure 1.5. 

The gain scheduler consists of a look-up table and the appropriate logic 
for detecting the operating point and choosing the corresponding value of 
0,: from the table. In the case of aircraft, the auxiliary measurements are 
the Mach number and the dynamic pressure. With this approach plant 
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Figure 1.5 Gain scheduling. 

parameter variations can be compensated by changing the controller gains 
as functions of the auxiliary measurements. 

The advantage of gain scheduling is that the controller gains can be 
changed as quickly as the auxiliary measurements respond to parameter 
changes. Frequent and rapid changes of the controller gains, however, may 
lead to instability [226]; therefore, there is a limit as to how often and how 
fast the controller gains can be changed. 

One of the disadvantages of gain scheduling is that the adjustment mech- 
anism of the controller gains is precomputed off-line and, therefore, provides 
no feedback to compensate for incorrect schedules. Unpredictable changes 
in the plant dynamics may lead to deterioration of performance or even to 
complete failure. Another possible drawback of gain scheduling is the high 
design and implementation costs that increase with the number of operating 
points. 

Despite its limitations, gain scheduling is a popular method for handling 
parameter variations in flight control [140, 210] and other systems [8]. 

1.2.3 Direct and Indirect Adaptive Control 

An adaptive controller is formed by combining an on-line parameter estima- 
tor, which provides estimates of unknown parameters at each instant, with 
a control law that is motivated from the known parameter case. The way 
the parameter estimator, also referred to as adaptive law in the book, is 
combined with the control law gives rise to two different approaches. In the 
first approach, referred to as indirect adaptive control, the plant parameters 
are estimated on-line and used to calculate the controller parameters. This 
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approach has also been referred to as explicit adaptive control , because the 
design is based on an explicit plant model. 

In the second approach, referred to as direct adaptive control, the plant 
model is parameterized in terms of the controller parameters that are esti- 
mated directly without intermediate calculations involving plant parameter 
estimates. This approach has also been referred to as implicit adaptive con- 
trol because the design is based on the estimation of an implicit plant model. 

In indirect adaptive control, the plant model P(9*) is parameterized with 
respect to some unknown parameter vector 9*. For example, for a linear 
time invariant (LTI) single-input single-output (SISO) plant model, 9* may 
represent the unknown coefficients of the numerator and denominator of the 
plant model transfer function. An on-line parameter estimator generates 
an estimate 9{t ) of 9* at each time t by processing the plant input u and 
output y. The parameter estimate 9(t) specifies an estimated plant model 
characterized by P(9(t )) that for control design purposes is treated as the 
“true” plant model and is used to calculate the controller parameter or gain 
vector 9 c (t) by solving a certain algebraic equation 9 c (t) = F(9(t)) at each 
time t. The form of the control law C(9 c ) and algebraic equation 9 C = F{9 ) 
is chosen to be the same as that of the control law C(9*) and equation 9* = 
F{6*) that could be used to meet the performance requirements for the plant 
model P(9*) if 9* was known. It is, therefore, clear that with this approach, 
C[6 c (t )) is designed at each time t to satisfy the performance requirements 
for the estimated plant model P(9(t)), which may be different from the 
unknown plant model P(9*). Therefore, the principal problem in indirect 
adaptive control is to choose the class of control laws C(9 c ) and the class 
of parameter estimators that generate 9(t ) as well as the algebraic equation 
9 c (t) = F(6(t )) so that C(6 c (t )) meets the performance requirements for 
the plant model P(9*) with unknown 9*. We will study this problem in 
great detail in Chapters 6 and 7, and consider the robustness properties of 
indirect adaptive control in Chapters 8 and 9. The block diagram of an 
indirect adaptive control scheme is shown in Figure 1.6. 

In direct adaptive control, the plant model P(9*) is parameterized in 
terms of the unknown controller parameter vector 9*, for which C{9*) meets 
the performance requirements, to obtain the plant model P c {9*) with exactly 
the same input/output characteristics as P(6*). 

The on-line parameter estimator is designed based on P c (9*) instead of 
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Figure 1.6 Indirect adaptive control. 


P(9*) to provide direct estimates 0 c (t) of 9* at each time t by processing the 
plant input u and output y. The estimate 9 c (t) is then used to update the 
controller parameter vector 9 C without intermediate calculations. The choice 
of the class of control laws C(9 c ) and parameter estimators generating 0 c (t) 
for which C{9 c {t )) meets the performance requirements for the plant model 
P{9*) is the fundamental problem in direct adaptive control. The properties 
of the plant model P(9*) are crucial in obtaining the parameterized plant 
model P c (9*) that is convenient for on-line estimation. As a result, direct 
adaptive control is restricted to a certain class of plant models. As we will 
show in Chapter 6, a class of plant models that is suitable for direct adaptive 
control consists of all SISO LTI plant models that are minimum-phase, i.e., 
their zeros are located in Re [s] <0. The block diagram of direct adaptive 
control is shown in Figure 1.7. 

The principle behind the design of direct and indirect adaptive control 
shown in Figures 1.6 and 1.7 is conceptually simple. The design of C(9 c ) 
treats the estimates 9 c {t ) (in the case of direct adaptive control) or the 
estimates 9(t) (in the case of indirect adaptive control) as if they were the 
true parameters. This design approach is called certainty equivalence and can 
be used to generate a wide class of adaptive control schemes by combining 
different on-line parameter estimators with different control laws. 
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Figure 1.7 Direct adaptive control. 

The idea behind the certainty equivalence approach is that as the param- 
eter estimates 0 c (t) and 9(t) converge to the true ones 9* and 9*, respectively, 
the performance of the adaptive controller C(9 c ) tends to that achieved by 
C{9 *) in the case of known parameters. 

The distinction between direct and indirect adaptive control may be con- 
fusing to most readers for the following reasons: The direct adaptive control 
structure shown in Figure 1.7 can be made identical to that of the indi- 
rect adaptive control by including a block for calculations with an identity 
transformation between updated parameters and controller parameters. In 
general, for a given plant model the distinction between the direct and in- 
direct approach becomes clear if we go into the details of design and anal- 
ysis. For example, direct adaptive control can be shown to meet the per- 
formance requirements, which involve stability and asymptotic tracking, for 
a minimum-phase plant. It is still not clear how to design direct schemes 
for nonminimum-phase plants. The difficulty arises from the fact that, in 
general, a convenient (for the purpose of estimation) parameterization of the 
plant model in terms of the desired controller parameters is not possible for 
nonminimum-phase plant models. 

Indirect adaptive control, on the other hand, is applicable to both 
minimum- and nonminimum-phase plants. In general, however, the mapping 
between 9(t) and 9 c (t), defined by the algebraic equation 9 c (t) = F(9(t)), 
cannot be guaranteed to exist at each time t giving rise to the so-called 
stabilizability problem that is discussed in Chapter 7. As we will show in 
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Figure 1.8 Model reference control. 

Chapter 7, solutions to the stabilizability problem are possible at the expense 
of additional complexity. 

Efforts to relax the minimum-phase assumption in direct adaptive control 
and resolve the stabilizability problem in indirect adaptive control led to 
adaptive control schemes where both the controller and plant parameters 
are estimated on-line, leading to combined direct/indirect schemes that are 
usually more complex [112]. 

1.2.4 Model Reference Adaptive Control 

Model reference adaptive control (MRAC) is derived from the model follow- 
ing problem or model reference control (MRC) problem. In MRC, a good 
understanding of the plant and the performance requirements it has to meet 
allow the designer to come up with a model, referred to as the reference 
model , that describes the desired I/O properties of the closed-loop plant. 
The objective of MRC is to find the feedback control law that changes the 
structure and dynamics of the plant so that its I/O properties are exactly 
the same as those of the reference model. The structure of an MRC scheme 
for a LTI, SISO plant is shown in Figure 1.8. The transfer function W m (s) of 
the reference model is designed so that for a given reference input signal r(t ) 
the output Dm{t) of the reference model represents the desired response the 
plant output y(t) should follow. The feedback controller denoted by C(9*) 
is designed so that all signals are bounded and the closed-loop plant transfer 
function from r to y is equal to W m (s). This transfer function matching 
guarantees that for any given reference input r(f), the tracking error 






1.2. ADAPTIVE CONTROL 


13 



Reference Model 


W m { S ) 


Vm 




Input 

Commandl 
r 


^ Controller 

u 

► C(9 c ) 




Plant 

P{9*) 


On- 

Par ar 

Estimati 

Line 

◄ 

neter 

ion of 9* 


m 

Calculations 
m = F(9(t)) 



+ 


Figure 1.9 Indirect MRAC. 


ei — y — y m , which represents the deviation of the plant output from the 
desired trajectory y m , converges to zero with time. The transfer function 
matching is achieved by canceling the zeros of the plant transfer function 
G(s) and replacing them with those of W m (s) through the use of the feedback 
controller C(9*). The cancellation of the plant zeros puts a restriction on 
the plant to be minimum phase, i.e. , have stable zeros. If any plant zero is 
unstable, its cancellation may easily lead to unbounded signals. 

The design of C(9*) requires the knowledge of the coefficients of the plant 
transfer function G(s). If 0* is a vector containing all the coefficients of 
G(s) = G(s, 9*), then the parameter vector 9* may be computed by solving 
an algebraic equation of the form 


9* = F{9*) (1.2.3) 

It is, therefore, clear that for the MRC objective to be achieved the plant 
model has to be minimum phase and its parameter vector 9* has to be known 
exactly. 
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Figure 1.10 Direct MRAC. 

When 9* is unknown the MRC scheme of Figure 1.8 cannot be imple- 
mented because 6* cannot be calculated using (1.2.3) and is, therefore, un- 
known. One way of dealing with the unknown parameter case is to use the 
certainty equivalence approach to replace the unknown 9* in the control law 
with its estimate 9 c (t) obtained using the direct or the indirect approach. 
The resulting control schemes are known as MRAC and can be classified as 
indirect MRAC shown in Figure 1.9 and direct MRAC shown in Figure 1.10. 

Different choices of on-line parameter estimators lead to further classifi- 
cations of MRAC. These classifications and the stability properties of both 
direct and indirect MRAC will be studied in detail in Chapter 6. 

Other approaches similar to the certainty equivalence approach may be 
used to design direct and indirect MRAC schemes. The structure of these 
schemes is a modification of those in Figures 1.9 and 1.10 and will be studied 
in Chapter 6. 

1.2.5 Adaptive Pole Placement Control 

Adaptive pole placement control (APPC) is derived from the pole placement 
control (PPC) and regulation problems used in the case of LTI plants with 
known parameters. 
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Figure 1.11 Pole placement control. 

In PPC, the performance requirements are translated into desired loca- 
tions of the poles of the closed- loop plant. A feedback control law is then 
developed that places the poles of the closed-loop plant at the desired loca- 
tions. A typical structure of a PPC scheme for a LTI, SISO plant is shown 
in Figure 1.11. 

The structure of the controller C(9*) and the parameter vector 9* are 
chosen so that the poles of the closed-loop plant transfer function from r to 
y are equal to the desired ones. The vector 9* is usually calculated using an 
algebraic equation of the form 


9* = F(9*) (1.2.4) 

where 9* is a vector with the coefficients of the plant transfer function G(s). 

If 9* is known, then 9* is calculated from (1.2.4) and used in the control 
law. When 9* is unknown, 9* is also unknown, and the PPC scheme of 
Figure 1.11 cannot be implemented. As in the case of MRC, we can deal with 
the unknown parameter case by using the certainty equivalence approach to 
replace the unknown vector 9* with its estimate 9 c (t). The resulting scheme 
is referred to as adaptive pole placement control (APPC). If 9 c (t) is updated 
directly using an on-line parameter estimator, the scheme is referred to as 
direct APPC. If 9 c {t) is calculated using the equation 

9 c (t) = F(9(t)) (1.2.5) 

where 9(t ) is the estimate of 9* generated by an on-line estimator, the scheme 
is referred to as indirect APPC. The structure of direct and indirect APPC 
is the same as that shown in Figures 1.6 and 1.7 respectively for the general 
case. 

The design of APPC schemes is very flexible with respect to the choice 
of the form of the controller C(9 c ) and of the on-line parameter estimator. 
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For example, the control law may be based on the linear quadratic design 
technique, frequency domain design techniques, or any other PPC method 
used in the known parameter case. Various combinations of on-line estima- 
tors and control laws lead to a wide class of APPC schemes that are studied 
in detail in Chapter 7. 

APPC schemes are often referred to as self-tuning regulators in the liter- 
ature of adaptive control and are distinguished from MRAC. The distinction 
between APPC and MRAC is more historical than conceptual because as 
we will show in Chapter 7, MRAC can be considered as a special class of 
APPC. MRAC was first developed for continuous-time plants for model fol- 
lowing, whereas APPC was initially developed for discrete-time plants in a 
stochastic environment using minimization techniques. 

1.2.6 Design of On-Line Parameter Estimators 

As we mentioned in the previous sections, an adaptive controller may be con- 
sidered as a combination of an on-line parameter estimator with a control 
law that is derived from the known parameter case. The way this combina- 
tion occurs and the type of estimator and control law used gives rise to a 
wide class of different adaptive controllers with different properties. In the 
literature of adaptive control the on-line parameter estimator has often been 
referred to as the adaptive law, update law, or adjustment mechanism. In this 
book we will often refer to it as the adaptive law. The design of the adaptive 
law is crucial for the stability properties of the adaptive controller. As we 
will see in this book the adaptive law introduces a multiplicative nonlinearity 
that makes the closed-loop plant nonlinear and often time varying. Because 
of this, the analysis and understanding of the stability and robustness of 
adaptive control schemes are more challenging. 

Some of the basic methods used to design adaptive laws are 

(i) Sensitivity methods 

(ii) Positivity and Lyapunov design 

(iii) Gradient method and least-squares methods based on estimation error 
cost criteria 

The last three methods are used in Chapters 4 and 8 to design a wide class 
of adaptive laws. The sensitivity method is one of the oldest methods used 
in the design of adaptive laws and will be briefly explained in this section 
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together with the other three methods for the sake of completeness. It will 
not be used elsewhere in this book for the simple reason that in theory the 
adaptive laws based on the last three methods can be shown to have better 
stability properties than those based on the sensitivity method. 

(i) Sensitivity methods 

This method became very popular in the 1960s [34, 104], and it is still 
used in many industrial applications for controlling plants with uncertainties. 
In adaptive control, the sensitivity method is used to design the adaptive 
law so that the estimated parameters are adjusted in a direction that min- 
imizes a certain performance function. The adaptive law is driven by the 
partial derivative of the performance function with respect to the estimated 
parameters multiplied by an error signal that characterizes the mismatch 
between the actual and desired behavior. This derivative is called sensitivity 
function and if it can be generated on-line then the adaptive law is irnple- 
mentable. In most formulations of adaptive control, the sensitivity function 
cannot be generated on-line, and this constitutes one of the main drawbacks 
of the method. The use of approximate sensitivity functions that are irn- 
plementable leads to adaptive control schemes whose stability properties are 
either weak or cannot be established. 

As an example let us consider the design of an adaptive law for updating 
the controller parameter vector 9 C of the direct MRAC scheme of Figure 1.10. 

The tracking error e± represents the deviation of the plant output y from 
that of the reference model, i.e., e\ = y — y m . Because 9 C = 9* implies that 
e\ = 0 at steady state, a nonzero value of e\ may be taken to imply that 
9 c f= 9*. Because y depends on 9 C , i.e., y = y(9 c ) we have e\ = e\ (9 C ) and, 
therefore, one way of reducing e\ to zero is to adjust 9 C in a direction that 
minimizes a certain cost function of ei. A simple cost function for e\ is the 
quadratic function 

J(0c) = ^ (1.2.6) 

A simple method for adjusting 9 C to minimize J[9 C ) is the method of 
steepest descent or gradient method (see Appendix B) that gives us the 
adaptive law 


9 C = - 7 VJ(0 c ) = - 7 eiVei(0 c ) 


(1.2.7) 
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where 


Vei(0 c ) = 


dei de\ 


de\ 


do r i ’ do r T'"' do, 


T 


is the gradient of e\ with respect to 


(1.2.8) 


Oc = [Ocl, 8c2, ■■■, 0 cn ] T 


Because 

Vei (0 c ) = \7y(0 c ) 

we have 

Oc = -'yeiVy(O c ) (1.2.9) 

where 7 > 0 is an arbitrary design constant referred to as the adaptive gain 
and i = 1 , 2 ,...,n are the sensitivity functions of y with respect to the 
elements of the controller parameter vector 0 C . The sensitivity functions 
represent the sensitivity of the plant output to changes in the controller 
parameter 0 C . 

In (1.2.7) the parameter vector 0 C is adjusted in the direction of steepest 
descent that decreases J(0 C ) = 61 ^ . If J(0 C ) is a convex function, then it 
has a global minimum that satisfies Vy(# c ) = 0, i.e. , at the minimum 0 C = 0 
and adaptation stops. 

The implementation of (1.2.9) requires the on-line generation of the sen- 
sitivity functions Vy that usually depend on the unknown plant parameters 
and are, therefore, unavailable. In these cases, approximate values of the 
sensitivity functions are used instead of the actual ones. One type of ap- 
proximation is to use some a priori knowledge about the plant parameters 
to compute the sensitivity functions. 

A popular method for computing the approximate sensitivity functions 
is the so-called MIT rule. With this rule the unknown parameters that are 
needed to generate the sensitivity functions are replaced by their on-line esti- 
mates. Unfortunately, with the use of approximate sensitivity functions, it is 
not possible, in general, to prove global closed-loop stability and convergence 
of the tracking error to zero. In simulations, however, it was observed that 
the MIT rule and other approximation techniques performed well when the 
adaptive gain 7 and the magnitude of the reference input signal are small. 
Averaging techniques are used in [135] to confirm these observations and es- 
tablish local stability for a certain class of reference input signals. Globally, 
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however, the schemes based on the MIT rule and other approximations may 
go unstable. Examples of instability are presented in [93, 187, 202], 

We illustrate the use of the MIT rule for the design of an MRAC scheme 
for the plant 

y=-aiy-a 2 y + u (1.2.10) 

where a\ and 02 are the unknown plant parameters, and y and y are available 
for measurement. 

The reference model to be matched by the closed loop plant is given by 


?/m — y m + 7” (1.2.11) 

The control law 

u = 9ly + 6* 2 y + r (1.2.12) 

where 

01 = 01-2,05 = 02-1 (1.2.13) 

will achieve perfect model following. The equation (1.2.13) is referred to as 
the matching equation. Because a\ and are unknown, the desired values 
of the controller parameters 0\ and 0\ cannot be calculated from (1.2.13). 
Therefore, instead of (1.2.12) we use the control law 

u = 0iy + 0 2 y + r (1.2.14) 


where 9\ and 02 are adjusted using the MIT rule as 


i, dy a dy 

e ' = -~ iei W 1 ’ #2 = - 7ei aS 


(1.2.15) 


where e± = y—y m - To implement (1.2.15), we need to generate the sensitivity 
functions J^-, on-line. 

Using (1.2.10) and (1.2.14) we obtain 


dy 

d0 x 

dy 

802 


dy dy dy 

ai d0i a ' 2 d0 1 +y + 9l d0i 


dy dy dy 

ai d0 2 a ' 2 d0 2 + V + 6l d0 2 


, n dy 

+ 02 w 1 


dy 

+ 02 W 2 


(1.2.16) 

(1.2.17) 
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If we now assume that the rate of adaptation is slow, i.e., Q\ and 0 2 are 
small, and the changes of ij and y with respect to 9\ and 62 are also small, 
we can interchange the order of differentiation to obtain 


d 2 dy 

dUWi 


{9i - ai) 


d dy 
dt d9\ 


+ ( 0 2 - a 2 ) 


9y 

dOi 


+ y 


(1.2.18) 


d^dy_ _ d dy 

dt 2 d9 2 1 Ql dt d9 2 

which we may rewrite as 




+ y 


dy = 1 . 

dOi p 2 - ( 0i - ai)p- (0 2 - a 2 ) V 


(1.2.19) 


(1.2.20) 


dy = 1 

d0 2 p 2 - ( 0i - ai)p - (0 2 - a 2 ) V 


(1.2.21) 


where p(-) = is the differential operator. 

Because a\ and a 2 are unknown, the above sensitivity functions cannot 
be used. Using the MIT rule, we replace a\ and a 2 with their estimates d\ 
and a 2 in the matching equation (1.2.13), i.e., we relate the estimates a\ and 
a 2 with 0\ and O2 using 


ai — 0i + 2, a 2 — 02 + 1 


(1.2.22) 


and obtain the approximate sensitivity functions 

dy 1 . dy 1 

d0\ p 2 + 2 p+l^' <90 2 p 2 + 2 p + l 


(1.2.23) 


The equations given by (1.2.23) are known as the sensitivity filters or mod- 
els. , and can be easily implemented to generate the approximate sensitivity 
functions for the adaptive law (1.2.15). 

As shown in [93, 135], the MRAC scheme based on the MIT rule is 
locally stable provided the adaptive gain 7 is small, the reference input 
signal has a small amplitude and sufficient number of frequencies, and the 
initial conditions 0i(O) and 0 2 (O) are close to 0J and 0\ respectively. 

For larger 7 and 0i(O) and 0 2 (O) away from 0\ and O^- the MIT rule may 
lead to instability and unbounded signal response. 
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The lack of stability of MIT rule based adaptive control schemes promp- 
ted several researchers to look for different methods of designing adaptive 
laws. These methods include the positivity and Lyapunov design approach, 
and the gradient and least-squares methods that are based on the minimiza- 
tion of certain estimation error criteria. These methods are studied in detail 
in Chapters 4 and 8, and are briefly described below. 

(ii) Positivity and Lyapunov design 

This method of developing adaptive laws is based on the direct method 
of Lyapunov and its relationship with positive real functions. In this ap- 
proach, the problem of designing an adaptive law is formulated as a sta- 
bility problem where the differential equation of the adaptive law is chosen 
so that certain stability conditions based on Lyapunov theory are satisfied. 
The adaptive law developed is very similar to that based on the sensitivity 
method. The only difference is that the sensitivity functions in the approach 
(i) are replaced with ones that can be generated on-line. In addition, the 
Lyapunov-based adaptive control schemes have none of the drawbacks of the 
MIT rule-based schemes. 

The design of adaptive laws using Lyapunov’s direct method was sug- 
gested by Grayson [76], Parks [187], and Shackcloth and Butchart [202] in 
the early 1960s. The method was subsequently advanced and generalized to 
a wider class of plants by Phillipson [188], Monopoli [149], Narendra [172], 
and others. 

A significant part of Chapters 4 and 8 will be devoted to developing 
adaptive laws using the Lyapunov design approach. 

(iii) Gradient and least-squares methods based on estimation 
error cost criteria 

The main drawback of the sensitivity methods used in the 1960s is that 
the minimization of the performance cost function led to sensitivity functions 
that are not implement able. One way to avoid this drawback is to choose a 
cost function criterion that leads to sensitivity functions that are available for 
measurement. A class of such cost criteria is based on an error referred to as 
the estimation error that provides a measure of the discrepancy between the 
estimated and actual parameters. The relationship of the estimation error 
with the estimated parameters is chosen so that the cost function is convex, 
and its gradient with respect to the estimated parameters is implement able. 
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Several different cost criteria may be used, and methods, such as the gradient 
and least-squares, may be adopted to generate the appropriate sensitivity 
functions. 

As an example, let us design the adaptive law for the direct MRAC law 
(1.2.14) for the plant (1.2.10). 

We first rewrite the plant equation in terms of the desired controller 
parameters given by (1.2.13), i.e., we substitute for a\ = 2 + 8 \, a >2 = 1 + 8 % 
in (1.2.10) to obtain 


y = -2 y - y - 0 \y - 0 * 2 y + u 


which may be rewritten as 


(1.2.24) 


y = Qlyf + Qlyf + uf 


(1.2.25) 


where 


Vf = 


1 

s2~+ 27+1^’ 


Vf = 


1 

s 2 + 2 s+l V ' 


Uf 


1 

s 2 + 2 s + 1 U 


(1.2.26) 


are signals that can be generated by filtering. 

If we now replace 6 \ and 6 \ with their estimates 6 \ and 82 in equation 
(1.2.25), we will obtain, 


y = Oijjf + 02Vf + Uf (1.2.27) 

where y is the estimate of y based on the estimate 8 \ and 82 of 8 \ and 02 - 
The error 

£i = y-y = y- yf - 0 2 yf - u f ( 1 . 2 . 28 ) 

is, therefore, a measure of the discrepancy between 8 1 , 82 and 81,8%, respec- 
tively. We refer to it as the estimation error. The estimates 8 \ and 82 can 
now be adjusted in a direction that minimizes a certain cost criterion that 
involves £\. A simple such criterion is 

j{ 8 i, 8 2 ) = Y = \(y~ 0 iyi ~ e ^yf - u s ) 2 ( 1 . 2 . 29 ) 

which is to be minimized with respect to 8 82 - It is clear that J{ 8 \ , 82 ) is a 
convex function of 8 \, 82 and, therefore, the minimum is given by VJ = 0. 
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If we now use the gradient method to minimize J{ 6 1, #2), we obtain the 
adaptive laws 


Oi = -71 -QQ- = 7i^i Vf, 0-2 = -72 = l 2 £iVf (1.2.30) 

where 71,72 > 0 are the adaptive gains and E\ , y/,y/ are all implementable 
signals. 

Instead of (1.2.29), one may use a different cost criterion for E\ and a 
different minimization method leading to a wide class of adaptive laws. In 
Chapters 4 to 9 we will examine the stability properties of a wide class 
of adaptive control schemes that are based on the use of estimation error 
criteria, and gradient and least-squares type of optimization techniques. 


1.3 A Brief History 

Research in adaptive control has a long history of intense activities that 
involved debates about the precise definition of adaptive control, examples 
of instabilities, stability and robustness proofs, and applications. 

Starting in the early 1950s, the design of autopilots for high-performance 
aircraft motivated an intense research activity in adaptive control. High- 
performance aircraft undergo drastic changes in their dynamics when they fly 
from one operating point to another that cannot be handled by constant-gain 
feedback control. A sophisticated controller, such as an adaptive controller, 
that could learn and accommodate changes in the aircraft dynamics was 
needed. Model reference adaptive control was suggested by Whitaker et 
al. in [184, 235] to solve the autopilot control problem. The sensitivity 
method and the MIT rule was used to design the adaptive laws of the various 
proposed adaptive control schemes. An adaptive pole placement scheme 
based on the optimal linear quadratic problem was suggested by Kalman in 
[96]. 

The work on adaptive flight control was characterized by “a lot of en- 
thusiasm, bad hardware and non-existing theory” [11]. The lack of stability 
proofs and the lack of understanding of the properties of the proposed adap- 
tive control schemes coupled with a disaster in a flight test [219] caused the 
interest in adaptive control to diminish. 
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The 1960s became the most important period for the development of 
control theory and adaptive control in particular. State space techniques 
and stability theory based on Lyapunov were introduced. Developments 
in dynamic programming [19, 20], dual control [53] and stochastic control 
in general, and in system identification and parameter estimation [13, 229] 
played a crucial role in the reformulation and redesign of adaptive control. 
By 1966 Parks and others found a way of redesigning the MIT rule-based 
adaptive laws used in the MRAC schemes of the 1950s by applying the 
Lyapunov design approach. Their work, even though applicable to a special 
class of LTI plants, set the stage for further rigorous stability proofs in 
adaptive control for more general classes of plant models. 

The advances in stability theory and the progress in control theory in 
the 1960s improved the understanding of adaptive control and contributed 
to a strong renewed interest in the field in the 1970s. On the other hand, 
the simultaneous development and progress in computers and electronics 
that made the implementation of complex controllers, such as the adaptive 
ones, feasible contributed to an increased interest in applications of adaptive 
control. The 1970s witnessed several breakthrough results in the design 
of adaptive control. MRAC schemes using the Lyapunov design approach 
were designed and analyzed in [48, 153, 174], The concepts of positivity 
and hyperstability were used in [123] to develop a wide class of MRAC 
schemes with well-established stability properties. At the same time parallel 
efforts for discrete-time plants in a deterministic and stochastic environment 
produced several classes of adaptive control schemes with rigorous stability 
proofs [72, 73]. The excitement of the 1970s and the development of a wide 
class of adaptive control schemes with well established stability properties 
was accompanied by several successful applications [80, 176, 230]. 

The successes of the 1970s, however, were soon followed by controversies 
over the practicality of adaptive control. As early as 1979 it was pointed 
out that the adaptive schemes of the 1970s could easily go unstable in the 
presence of small disturbances [48]. The nonrobust behavior of adaptive 
control became very controversial in the early 1980s when more examples of 
instabilities were published demonstrating lack of robustness in the presence 
of unmodeled dynamics or bounded disturbances [85, 197]. This stimulated 
many researchers, whose objective was to understand the mechanisms of 
instabilities and find ways to counteract them. By the mid 1980s, several 
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new redesigns and modifications were proposed and analyzed, leading to a 
body of work known as robust adaptive control. An adaptive controller is 
defined to be robust if it guarantees signal boundedness in the presence of 
“reasonable” classes of unmodeled dynamics and bounded disturbances as 
well as performance error bounds that are of the order of the modeling error. 

The work on robust adaptive control continued throughout the 1980s 
and involved the understanding of the various robustness modifications and 
their unification under a more general framework [48, 87, 84] . 

The solution of the robustness problem in adaptive control led to the 
solution of the long-standing problem of controlling a linear plant whose 
parameters are unknown and changing with time. By the end of the 1980s 
several breakthrough results were published in the area of adaptive control 
for linear time- varying plants [226]. 

The focus of adaptive control research in the late 1980s to early 1990s 
was on performance properties and on extending the results of the 1980s to 
certain classes of nonlinear plants with unknown parameters. These efforts 
led to new classes of adaptive schemes, motivated from nonlinear system 
theory [98, 99] as well as to adaptive control schemes with improved transient 
and steady-state performance [39, 211]. 

Adaptive control has a rich literature full with different techniques for 
design, analysis, performance, and applications. Several survey papers [56, 
183], and books and monographs [3, 15, 23, 29, 48, 55, 61, 73, 77, 80, 85, 
94, 105, 123, 144, 169, 172, 201, 226, 229, 230] have already been published. 
Despite the vast literature on the subject, there is still a general feeling that 
adaptive control is a collection of unrelated technical tools and tricks. The 
purpose of this book is to unify the various approaches and explain them in 
a systematic and tutorial manner. 



Chapter 2 


Models for Dynamic Systems 


2.1 Introduction 

In this chapter, we give a brief account of various models and parameteriza- 
tions of LTI systems. Emphasis is on those ideas that are useful in studying 
the parameter identification and adaptive control problems considered in 
subsequent chapters. 

We begin by giving a summary of some canonical state space models for 
LTI systems and of their characteristics. Next we study I/O descriptions 
for the same class of systems by using transfer functions and differential 
operators. We express transfer functions as ratios of two polynomials and 
present some of the basic properties of polynomials that are useful for control 
design and system modeling. 

systems that we express in a form in which parameters, such as coeffi- 
cients of polynomials in the transfer function description, are separated from 
signals formed by filtering the system inputs and outputs. These paramet- 
ric models and their properties are crucial in parameter identification and 
adaptive control problems to be studied in subsequent chapters. 

The intention of this chapter is not to give a complete picture of all 
aspects of LTI system modeling and representation, but rather to present a 
summary of those ideas that are used in subsequent chapters. For further 
discussion on the topic of modeling and properties of linear systems, we 
refer the reader to several standard books on the subject starting with the 
elementary ones [25, 41, 44, 57, 121, 180] and moving to the more advanced 
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ones [30, 42, 95, 198, 237, 238]. 

2.2 State-Space Models 

2.2.1 General Description 

Many systems are described by a set of differential equations of the form 

x(t) = f(x(t),u(t),t), x(t 0 )=x o 

y(t) = g(x(t),u(t),t) (2.2.1) 

where 

t is the time variable 

x(t) is an n-dimensional vector with real elements that denotes the state 
of the system 

u(t) is an r-dimensional vector with real elements that denotes the input 
variable or control input of the system 
y(t ) is an /-dimensional vector with real elements that denotes the output 
variables that can be measured 
/, g are real vector valued functions 

n is the dimension of the state x called the order of the system 
x(to) denotes the value of x(t) at the initial time t = to >0 

When /, g are linear functions of x,u, (2.2.1) takes the form 

x = A(t)x + B(t)u, x(to) = xq 
y = C T (t)x + D{t)u (2.2.2) 

where A(t) € K nxn ,B(t) G K nxr ,C{t) E K nxl , and D(t) G K lxr are ma- 
trices with time-varying elements. If in addition to being linear, /, g do not 
depend on time t, we have 

x = Ax + Bu, x(to) = xq 
y = C T x + Du (2.2.3) 

where A,B,C, and D are matrices of the same dimension as in (2.2.2) but 
with constant elements. 
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We refer to (2.2.2) as the finite-dimensional linear time-varying (LTV) 
system and to (2.2.3) as the finite dimensional LTI system. 

The solution x(t),y(t) of (2.2.2) is given by 

x(t) = $(t,to)x(to) + / $(t,r)B(r)u(r)dr 

Jto 

y(t ) = C T (t)x(t) + D(t)u(t) (2.2.4) 

where $(t, to) is the state transition matrix defined as a matrix that satisfies 
the linear homogeneous matrix equation 

d ®f t to) = Am(t, to), Hh, to) = I 
For the LTI system (2.2.3), <h(t, to) depends only on the difference t — to, i.e., 

$(t,to) = <h(t-to) = e A{t ~ to) 
and the solution x(t),y(t) of (2.2.3) is given by 

x{t) = e A ^- to) x 0 + f e A ^~ r) Bu{r)dT 
Jt 0 

y(t) = C T x(t ) + Du(t ) (2.2.5) 

where e At can be identified to be 

e At = £~ 1 [(sl- A)' 1 } 

where C~ 1 denotes the inverse Laplace transform and s is the Laplace vari- 
able. 

Usually the matrix D in (2.2.2), (2.2.3) is zero, because in most physical 
systems there is no direct path of nonzero gain between the inputs and 
outputs. 

In this book, we are concerned mainly with LTI, SISO systems with 
D = 0. In some chapters and sections, we will also briefly discuss systems of 
the form (2.2.2) and (2.2.3). 
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2.2.2 Canonical State-Space Forms 

Let us consider the SISO, LTI system 


x = Ax + Bu, x(to) = xo 
y = C T x (2.2.6) 

where x £ lZ n . The controllability matrix P c of (2.2.6) is defined by 

Pc= [B,AB,...,A n ~'B] 

A necessary and sufficient condition for the system (2.2.6) to be completely 
controllable is that P c is nonsingular. If (2.2.6) is completely controllable, 
the linear transformation 

x c = p ~ 1 x (2.2.7) 

transforms (2.2.6) into the controllability canonical form 


0 

0 

... 0 

—ao 

1 

0 

... 0 

—a\ 

o • ■ 

II 

o 

1 

... 0 

-02 

_ 0 

0 

... 1 

On— 1 


y = Cjx c 


1 

0 

x c + 0 u (2.2.8) 

0 


where the af s are the coefficients of the characteristic equation of A, i.e. , 
det(sl — A) = s n + an-is 11 ^ 1 + • • • + do and Cj = C T P c . 

If instead of (2.2.7), we use the transformation 


x c = M~ 1 P~ 1 x (2.2.9) 


where 


' 1 

On— 1 

• • • 02 

Ol 



0 

1 

• • • a 3 

02 


M = 

0 

0 

... i 

On— 1 



_ 0 

0 

... o 

1 
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we obtain the following controller canonical form 


x 


C 


y 


On— 1 

— On- 2 ' ' ' 

— Ol 

— Oo 


' 1 ' 

1 

0 

0 

0 


0 

0 

1 

0 

0 

x c + 

0 

0 

0 

1 

0 


_ 0 _ 


u (2.2.10) 


Cq x c 


where Cq = C T P C M. By rearranging the elements of the state vector x c , 
(2.2.10) may be written in the following form that often appears in books 
on linear system theory 


x 


C 


y 


0 

1 

0 

O' 


' 0 ' 

0 

0 

1 

0 


0 





X c + 


0 

0 


1 


0 

— Oo 

-ai 


On— 2 O n — 1 


_ 1 _ 


u (2.2.11) 



where C\ is defined appropriately. 

The observability matrix P a of (2.2.6) is defined by 


Po 


A 


C T 

C T A 


C T A n ~ l 


(2.2.12) 


A necessary and sufficient condition for the system (2.2.6) to be completely 
observable is that P Q is nonsingular. By following the dual of the arguments 
presented earlier for the controllability and controller canonical forms, we 
arrive at observability and observer forms provided P a is nonsingular [95] , 
i.e., the observability canonical form of (2.2.6) obtained by using the trans- 
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formation x Q = P Q x is 

1 0 0 

0 1 0 

| | x 0 + B 0 u (2.2.13) 

0 ••• 1 
-ai • • • — a n _ 2 — a n -\ 

V = [1,0, . . . ,0]x o 

and the observer canonical form is 

-a n - 1 1 0 

-a n - 2 0 1 

x 0 = \ ] 

— a\ 0 0 

— ao 0 0 

y = [i,o, . . . ,0].T O 

where B 0 ,B\ may be different. 

If the rank of the controllability matrix P c for the nth-order system 
(2.2.6) is less than n, then (2.2.6) is said to be uncontrollable. Similarly, 
if the rank of the observability matrix P Q is less than n, then (2.2.6) is 
unobservable. 

The system represented by (2.2.8) or (2.2.10) or (2.2.11) is completely 
controllable but not necessarily observable. Similarly, the system repre- 
sented by (2.2.13) or (2.2.14) is completely observable but not necessarily 
controllable. 

If the ?rth-order system (2.2.6) is either unobservable or uncontrollable 
then its I/O properties for zero initial state, i.e., xq = 0 are completely 
characterized by a lower order completely controllable and observable system 

X C o — AcoXco T B co u , x co {f o) — 0 

y = Cj 0 x co (2.2.15) 

where x co £ 7 Z nr and n r < n. It turns out that no further reduction in the or- 
der of (2.2.15) is possible without affecting the I/O properties for all inputs. 


0 

Xo + Bw (2.2.14) 

1 
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Figure 2.1 Cart with two inverted pendulums 

For this reason (2.2.15) is referred to as the minimal state-space represen- 
tation of the system to be distinguished from the nonminimal state-space 
representation that corresponds to either an uncontrollable or unobservable 
system. 

A minimal state space model does not describe the uncontrollable or 
unobservable parts of the system. These parts may lead to some unbounded 
states in the nonminimal state-space representation of the system if any 
initial condition associated with these parts is nonzero. If, however, the 
uncontrollable or unobservable parts are asymptotically stable [95], they 
will decay to zero exponentially fast, and their effect may be ignored in 
most applications. A system whose uncontrollable parts are asymptotically 
stable is referred to as st.abilizable, and the system whose unobservable parts 
are asymptotically stable is referred to as detectable [95] . 

Example 2.2.1 Let us consider the cart with the two inverted pendulums shown 
in Figure 2.1, where M is the mass of the cart, mi and m 2 are the masses of the 
bobs, and l\ and l? are the lengths of the pendulums, respectively. Using Newton’s 
law and assuming small angular deviations of |0i|, $ 2 1 , the equations of motions are 
given by 

Mv = —migdi — m.2C/02 + u 

toi(u + hOi) = m.\g6i 
m 2 (v + I 2 O 2 ) = m 2 g 0 2 

where v is the velocity of the cart, u is an external force, and g is the acceleration 
due to gravity. To simplify the algebra, let us assume that mi = m 2 = 1 kg and 
M = 10mi. If we now let x\ = 61 , x 2 = Oi, X 3 = 6 \ — 0 2 , aq = 0\ — 0 2 be the state 
variables, we obtain the following state-space representation for the system: 


x = Ax + Bu 
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where x = [24, X2, £3, 24] T 


0 

1 

0 

0 ' 


0 

1 .2ai 

0 

— O.loi 

0 

, B = 

Pi 

0 

0 

0 

1 

0 

1 .2(ai - ct 2 ) 

0 

o 2 — 0.1(oi — o 2 ) 

0 


1 


and ati = ±,a 2 = f^,P\ = and p 2 = 

The controllability matrix of the system is given by 

P c = [B, AB, A 2 B, A 3 B] 


We can verify that 


det P r = 


( 0 . 011 ) ! 2 g 2 (h - hY 
i\ii 


which implies that the system is controllable if and only if L ^ l 2 . 

Let us now assume that 9 \ is the only variable that we measure, i.e., the mea- 
sured output of the system is 

y = C T x 


where C = [ 1 , 0 , 0 , 0 ] T . The observability matrix of the system based on this output 
is given by 


P n = 


C r 
C T A 
C T A 2 
C T A 3 


By performing the calculations, we verify that 


detP Q = 0.01 

h 


which implies that the system is always observable from y = 9 \. 

When 1 1 = l 2 , the system is uncontrollable. In this case, Oi = a 2 ,P\ = / 3 2 , and 
the matrix A and vector B become 


0 

1 

0 

0 ' 


0 

1.2oi 

0 

—O.loi 

0 

, B = 

Pi 

0 

0 

0 

1 

0 

0 

0 

a 1 

0 


0 


indicating that the control input u cannot influence the states £3,24. It can be 
verified that for £3(0), £4(0) ^ 0, all the states will grow to infinity for all possible 
inputs u. For l\ = l 2 , the control of the two identical pendulums is possible provided 
the initial angles and angular velocities are identical, i.e., $i( 0 ) = 9 2 ( 0 ) and 0 i(O) = 
9 2 ( 0 ), which imply that 2:3(0) = 2:4(0) = 0 . V 
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2.3 Input/Output Models 

2.3.1 Transfer Functions 

Transfer functions play an important role in the characterization of the I/O 
properties of LTI systems and are widely used in classical control theory. 

We define the transfer function of an LTI system by starting with the 
differential equation that describes the dynamic system. Consider a system 
described by the nth-order differential equation 

y^\t)+a n . 1 y^- 1 \t)+- ■ ■ +a 0 y(t ) = b m u^ (t)+b m . 1 u^~ 1 \t )+- • -+b 0 u(t) 

(2.3.1) 

where yM(t) = ^yft), and u^\t) = 4/ u(t)\ u(t ) is the input variable, 
and y(t) is the output variable; the coefficients a*, bj, i = 0, 1 . . . , n — 1, j = 
0, 1, . . . , m are constants, and n and m are constant integers. To obtain the 
transfer function of the system (2.3.1), we take the Laplace transform on 
both sides of the equation and assume zero initial conditions, i.e., 

( s n + a n —is n 1 + • • • + ao)T(s) = ( b m s m + b m —\s m 1 + • • • + bo)U(s) 

where s is the Laplace variable. The transfer function G(s) of (2.3.1) is 
defined as 

r( \ - bmSm + frm-is™- 1 + • • • + fro , 9 , 

(j U(s) S n + On-15"- 1 + • • • + a 0 ( j 

The inverse Laplace g(t) of G(s), i.e., 

S(0 = £ _1 [G(s)] 

is known as the impulse response of the system (2.3.1) and 

y(t) = g(t ) * u(t) 


where * denotes convolution. When u(t) = 5&(t) where 5/\(t) is the delta 
function defined as 


6 a (t) = lim 

e — >0 


l(t) - l(t - e) 


where X(t) is the unit step function, then 


y(t) = g(t) * S A (t) = g(t ) 
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Therefore, when the input to the LTI system is a delta function (often re- 
ferred to as a unit impulse) at t = 0, the output of the system is equal to 
g(t), the impulse response. 

We say that G(s) is proper if G(oo) is finite i.e., n > m\ strictly proper if 
G(oo) = 0 , i.e., n > m; and biproper if n = m. 

The relative degree n* of G(s) is defined as n* = n — m, i.e., n* = degree 
of denominator - degree of numerator of G(s). 

The characteristic equation of the system (2.3.1) is defined as the equation 
s n + a n -\s n 1 + • • • + oo = 0. 

In a similar way, the transfer function may be defined for the LTI system 
in the state space form (2.2.3), i.e., taking the Laplace transform on each 
side of (2.2.3) we obtain 

sX(s)-x(0) = AX(s) + BU(s) 

Y (s) = C T X(s) + DU (s) { ’ 

or 

Y(s) = (c T (s/ - A)~ l B + £>) U{s) + C T {sI - A) _1 a;(0) 

Setting the initial conditions to zero, i.e., x(0) = 0 we get 

Y(s) = G(s)U(s) (2.3.4) 


where 

G(s) = C T (sI - A)~ 1 B + D 

is referred to as the transfer function matrix in the case of multiple inputs 
and outputs and simply as the transfer function in the case of SISO systems. 
We may also represent G(s) as 

a ^- c ~^hw^ +D < 2 - 3 - 5 ' 

where adjQ denotes the adjoint of the square matrix Q £ 1Z nxn . The (i. j) 
element qij of adjQ is given by 

Qij = (-l)* +J det(Q ji ); i,j = 1,2, ...n 
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where Qji € 7^( n-1 ) x (” _1 ) is a submatrix of Q obtained by eliminating the 
j th row and the ith column of the matrix Q. 

It is obvious from (2.3.5) that the poles of G(s) are included in the 
eigenvalues of A. We say that A is stable if all its eigenvalues lie in Refs] < 0 
in which case G(s) is a stable transfer function. It follows that det(s/— A) = 0 
is the characteristic equation of the system with transfer function given by 
(2.3.5). 

In (2.3.3) and (2.3.4) we went from a state-space representation to a 
transfer function description in a straightforward manner. The other way, 
i.e., from a proper transfer function description to a state-space represen- 
tation, is not as straightforward. It is true, however, that for every proper 
transfer function G(s) there exists matrices A, B , C, and D such that 

G(s) = C T {sI - A)~ 1 B + D 


As an example, consider a system with the transfer function 

q/ n _ b m s m + b m -is m 1 + • • • + bo _ Y(s) 
s n + a n -\s n ~ l H b ao U(s) 


where n > m. Then the system may be represented in the controller form 


Un— 1 

®n- 2 ' ' ' 

-ai 

—a o 


' 1 ' 

1 

0 

0 

0 


0 

x = 0 

1 

0 

0 

x + 







0 

0 

0 

1 

0 


_ 0 _ 


y = [ 0 , 0 , ...,b m ,...,bi,b 0 ]x 

or in the observer form 

‘ —a n -i 1 0 ■ ■ ■ 0 1 [ 0 ‘ 

~a n ~ 2 0 1 • • • 0 : 

x = \ \ [ x + b m u 

—a\ 0 0 • • • 1 j 

-a 0 0 0 • • • 0 J 5 0 


(2.3.6) 


(2.3.7) 


y 


[1,0, . . . ,0]x 
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One can go on and generate many different state-space representations 
describing the I/O properties of the same system. The canonical forms in 

(2.3.6) and (2.3.7), however, have some important properties that we will use 
in later chapters. For example, if we denote by (A c , B c , C c ) and (A a , B 0 , C 0 ) 
the corresponding matrices in the controller form (2.3.6) and observer form 

(2.3.7) , respectively, we establish the relations 


[adj(sl - A C )\B C = [ S n -\ . . . , s, 1] T = a n _i(s) (2.3.8) 

Cj adj (si - A 0 ) = [s n ~\ . . . , s, 1] = a^_i(s) (2.3.9) 

whose right-hand sides are independent of the coefficients of G(s). Another 
important property is that in the triples (A c , B c ,C c ) and (A a , B 0 ,C 0 ), the 
n + m+1 coefficients of G(s) appear explicitly, i.e., (A c , B c , C c ) (respectively 
(A a , B 0 , C 0 )) is completely characterized by n + m+ 1 parameters, which are 
equal to the corresponding coefficients of G(s). 

If G(s) has no zero-pole cancellations then both (2.3.6) and (2.3.7) are 
minimal state-space representations of the same system. If G(s) has zero- 
pole cancellations, then (2.3.6) is unobservable, and (2.3.7) is uncontrollable. 
If the zero-pole cancellations of G(s) occur in Re[-s] < 0, i.e., stable poles are 
cancelled by stable zeros, then (2.3.6) is detectable, and (2.3.7) is stabilizable. 
Similarly, a system described by a state-space representation is unobservable 
or uncontrollable, if and only if the transfer function of the system has zero- 
pole cancellations. If the unobservable or uncontrollable parts of the system 
are asymptotically stable, then the zero-pole cancellations occur in Re[s] < 0. 

An alternative approach for representing the differential equation (2.3.1) 
is by using the differential operator 


P(0 


A d(0 

dt 


which has the following properties: 


(i) p(x) = x ; (ii) p(xy) = xy + xy 

where x and y are any differentiable functions of time and x = . 

The inverse of the operator p denoted by p -1 or simply by ^ is defined 
as 

1 a r f 

-(x) = / x(r)dT + x( 0) 

P Jo 


Vi > 0 
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where x(t) is an integrable function of time. The operators p, ^ are related 
to the Laplace operator s by the following equations 


£{p(*)}|*( o)=o = sX ( s ) 

£{^0)} lx(0)=0 = ^(s) 

where C is the Laplace transform and x(t) is any differentiable function of 
time. Using the definition of the differential operator, (2.3.1) may be written 
in the compact form 

R(p)(y) = Z(p){u ) (2.3.10) 

where 

R(p) =p n + 0"n—ip n ^ 4 h a 0 

Z{p) = bmP m + bm-ip™- 1 4 h b 0 

are referred to as the polynomial differential operators [226]. 

Equation (2.3.10) has the same form as 

R(s)Y(s) = Z(s)U(s) (2.3.11) 


obtained by taking the Laplace transform on both sides of (2.3.1) and as- 
suming zero initial conditions. Therefore, for zero initial conditions one can 
go from representation (2.3.10) to (2.3.11) and vice versa by simply replacing 
s with p or p with s appropriately. For example, the system 


may be written as 


y(s) 


s + bo 
s 2 + a 0 


U(s ) 


{p 2 + a 0 ){y) = ( p + b 0 )(u ) 


with y{ 0) = y( 0) = 0, tt(0) = 0 or by abusing notation (because we never 
defined the operator (p 2 4- ao)” 1 ) as 


P + bp 
P 2 4- a 0 

Because of the similarities of the forms of (2.3.11) and (2.3.10), we will use 
s to denote both the differential operator and Laplace variable and express 
the system (2.3.1) with zero initial conditions as 



V = 


Zjs) 

R(s) 


u 


(2.3.12) 
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where y and u denote Y (s) and U(s), respectively, when s is taken to be the 
Laplace operator, and y and u denote y(t ) and u(t), respectively, when s is 
taken to be the differential operator. 

We will often refer to G(s ) = in (2.3.12) as the filter with input u(t) 
and output y(t). 

Example 2.3.1 Consider the system of equations describing the motion of the cart 
with the two pendulums given in Example 2.2.1, where y = 0 1 is the only measured 
output. Eliminating the variables 0 i,0 2 , and d 2 by substitution, we obtain the 
fourth order differential equation 

y I 4 ) — 1.1 (ai + 02 )y^ + 1.2a\a 2 y = Piu^ — a\P 2 u 

where at, /3i,i = 1,2 are as defined in Example 2.2.1, which relates the input u with 
the measured output y. 

Taking the Laplace transform on each side of the equation and assuming zero 
initial conditions, we obtain 

[s 4 - l.l(«i + a 2 )s 2 + 1.2ai a 2 ]Y(s) = {Pis 2 - ai(3 2 )U(s) 

Therefore, the transfer function of the system from u to y is given by 

= Pis 2 - oq/? 2 = x 

U(s) s 4 — l.l(ai + a 2 )s 2 + l.2aia 2 

For 1 1 = l 2 , we have oq = a 2 , (3\ = (3 2 , and 

= Pi(s 2 - Qi) = Pi (s 2 - ax) 
s 4 — 2.2ais 2 + 1.2a 2 (s 2 — a\ )(s 2 — 1.2ai) 

has two zero-pole cancellations. Because a\ > 0, one of the zero-pole cancellations 
occurs in Re[s] > 0 which indicates that any fourth-order state representation of 
the system with the above transfer function is not stabilizable. V 

2.3.2 Coprime Polynomials 

The I/O properties of most of the systems studied in this book are repre- 
sented by proper transfer functions expressed as the ratio of two polynomials 
in s with real coefficients, i.e., 

G(S ) - (2.3.13) 
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where Z(s) = b m s m + b m -\s m ~ l H b bo, R(s ) = s n + a n _is n_1 H b a 0 

and n > m. 

The properties of the system associated with G(s) depend very much 
on the properties of Z(s) and R(s). In this section, we review some of 
the general properties of polynomials that are used for analysis and control 
design in subsequent chapters. 

Definition 2.3.1 Consider the polynomial X(s) = a n s n + a n _is n_1 + • • • + 
no- We sa U that X{s) is monic if a n = 1 and X(s) is Hurwitz if all the 
roots of X(s) = 0 are located in Refs] < 0. We say that the degree of X(s) 
is n if the coefficient a n of s n satisfies a n 0. 

Definition 2.3.2 A system with a transfer function given by (2.3.13) is 
referred to as minimum phase if Z(s) is Hurwitz; it is referred to as stable 
if R(s) is Hurwitz. 

As we mentioned in Section 2.3.1, a system representation is minimal 
if the corresponding transfer function has no zero-pole cancellations, i.e., if 
the numerator and denominator polynomials of the transfer function have 
no common factors other than a constant. The following definition is widely 
used in control theory to characterize polynomials with no common factors. 

Definition 2.3.3 Two polynomials a(s) and b(s) are said to be coprime (or 
relatively prime ) if they have no common factors other than a constant. 

An important characterization of coprimeness of two polynomials is given 
by the following Lemma. 

Lemma 2.3.1 (Bezout Identity) Two polynomials a(s) and b(s) are co- 
prime if and only if there exist polynomials c(s ) and d(s) such that 

c(s)o(s) + d(s)b(s) = 1 

For a proof of Lemma 2.3.1, see [73, 237]. 

The Bezout identity may have infinite number of solutions c(s) and d(s) 
for a given pair of coprime polynomials a(s) and b(s) as illustrated by the 
following example. 
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Example 2.3.2 Consider the coprime polynomials a(s) = s + 1, b(s) = s + 2. Then 
the Bezout identity is satisfied for 

c(s) = s n + 2s n - 1 - 1 , d(s) = -s n - s"" 1 + 1 


and any n > 1. V 

Coprimeness is an important property that is often exploited in control 
theory for the design of control schemes for LTI systems. An important the- 
orem that is very often used for control design and analysis is the following. 

Theorem 2.3.1 If a(s) and b(s ) are coprime and of degree n a and rib, re- 
spectively, where n a > nb, then for any given arbitrary polynomial a*{s ) of 
degree n a * > n a , the polynomial equation 

a(s)l(s) + b(s)p(s) = a*(s ) (2.3.14) 

has a unique solution l(s) and p(s) whose degrees ni and n p , respectively, 
satisfy the constraints n p < n a ,ni < max(n a * — n a ,nb — 1). 


Proof From Lemma 2.3.1, there exist polynomials c(s) and d(s) such that 

a(s)c(s) + b(s)d(s) = 1 (2.3.15) 

Multiplying Equation (2.3.15) on both sides by the polynomial a*(s), we obtain 

a*(s)a(s)c(s) + a* (s)b(s)d(s) = a*(s) (2.3.16) 

Let us divide a*(s)d(s) by a(s), i.e., 

=>■<»)+ 44 

a(s) a(s) 

where r(s) is the quotient of degree n a * + — n a ; n a »,n a , and are the degrees 

of a*(s),a(s), and d(s), respectively, and p(s) is the remainder of degree n p < n a . 
We now use 

a*(s)d(s) = r(s)a(s) + p(s) 
to express the right-hand side of (2.3.16) as 


a*(s)a(s)c(s) + r(s)a(s)b(s) + p(s)b(s) = [a*(s)c(s) + r(s)6(s)]a(s) + p(s)b(s) 
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and rewrite (2.3.16) as 

l{s)a(s) + p(s)b(s) = a*(s) (2.3.17) 

where l(s) = a*(s)c(s) + r(s)b(s). The above equation implies that the degree of 
l(s)a(s ) = degree of (a*(s) — p(s)6(s)) < max{n a . , n p + rib}- Hence, the degree 
of l(s), denoted by ni, satisfies rq < max{n 0 . — n a ,n p + rib — n a }. We, therefore, 
established that polynomials l(s) and p{s) of degree n; < max{n a « — n a ,n p + 
rib — n a } and n p < n a respectively exist that satisfy (2.3.17). Because n p < n a 
implies that n p < n a — 1, the degree rii also satisfies rq < max{n* — n a ,nb — 1}. 
We show the uniqueness of l(s) and p(s) by proceeding as follows: We suppose 
that (h(s) , pi(s)) , ( l 2 (s),P 2 (s )) are two solutions of (2.3.17) that satisfy the degree 
constraints n p < n a ,ni < max{n 0 * — n a ,rib — 1}, i.e., 

a(s)h(s) + b(s)pi(s) = a*(s), a(s)/ 2 (s) + 6(s)p 2 (s) = a*(s) 

Subtracting one equation from another, we have 

a(s)(h(s) - his)) + b(s)(p 1 (s) -p 2 («)) = 0 (2.3.18) 


which implies that 


bjs) _ hjs) - hjs) 
a(s) Pi(s)-P2(s) 


(2.3.19) 


Because n p < n a , equation (2.3.19) implies that b(s),a(s) have common factors that 
contradicts with the assumption that a(s) and b(s) are coprime. Thus, h(s) = h(s) 
and pi(s) = p 2 (s), which implies that the solution l(s) and p{s) of (2.3.17) is unique, 
and the proof is complete. □ 


If no constraints are imposed on the degrees of l(s) and p(s), (2.3.14) has 
an infinite number of solutions. Equations of the form (2.3.14) are referred 
to as Diophantine equations and are widely used in the algebraic design 
of controllers for LTI plants. The following example illustrates the use of 
Theorem 2.3.1 for designing a stable control system. 


Example 2.3.3 Let us consider the following plant 

s — 1 

y = — 3-u 

s 


(2.3.20) 


We would like to choose the input u(t) so that the closed-loop characteristic equation 
of the plant is given by a*(s) = (s + 1) 5 , i.e., u is to be chosen so that the closed-loop 
plant is described by 


(s+ ify = 0 


(2.3.21) 
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Let us consider the control input in the form of 


p(s) 

u = -~rr^y 

l{s) 


(2.3.22) 


where l(s) and p(s) are polynomials with real coefficients whose degrees and coef- 
ficients are to be determined. Using (2.3.22) in (2.3.20), we have the closed-loop 
plant 

s 3 l(s)y = -( s - 1 )p(s)y 


i.e., 

[l(s)s 3 + p(s)(s - 1 )]y = 0 

If we now choose l (s) and p(s) to satisfy the Diophantine equation 


l(s)s 3 + p(s)(s - 1) = (s + l) 5 


(2.3.23) 


then the closed-loop plant becomes the same as the desired one given by (2.3.21). 

Because (2.3.23) may have an infinite number of solutions for l(s),p(s), we 
use Theorem 2.3.1 to choose l(s) and p(s) with the lowest degree. According to 
Theorem 2.3.1, Equation (2.3.23) has a unique solution l(s),p(s) of degree equal to 
at most 2. Therefore, we assume that l(s),p(s) have the form 

1(S) = + llS + Iq 

p(s) =P 2 S 2 +P1S + P0 

which we use in (2.3.23) to obtain the following polynomial equation 

I2S 5 +hs 4 + (l 0 +p 2 )s 3 + (pi-p2)s 2 + (p 0 -pi)s-p 0 = s 5 + 5s 4 -t-10s 3 -l-10s 2 -l-5s-l-l 


Equating the coefficients of the same powers of s on each side of the above equation, 
we obtain the algebraic equations 


' h = l 

h=5 

< ^0 + P2 = 10 
Pi - P2 = 10 
Po - Pi = 5 
-Po = 1 

which have the unique solution of L = 1, Zi = 5, Zq = 26, P 2 = — 16,pi = — 6,po = 
— 1. Hence, 

l(s) = s 2 + 5s + 26, p(s) = —16s 2 — 6s — 1 
and from (2.3.22) the control input is given by 
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u = 


16 s 2 + 6s + 1 
s 2 + 5s + 26 V 


V 


Another characterization of coprimeness that we use in subsequent chap- 
ters is given by the following theorem: 


Theorem 2.3.2 (Sylvester’s Theorem) Two polynomials a(s) = a n s n + 
a n _is n_1 + • • • + ao, b(s) = b n s n + 6 n _is n_1 + ■ ■ • + bo are coprime if and 
only if their Sylvester matrix S e is nonsingular, where S e is defined to be the 
following 2 n x 2 n matrix: 



0 

1 

a l 

ao 

0 

0 


0 

0 

a-n 


0 


0 0 b n 0 0 

0 0 b n -\ b n 0 

• ■ b n — i b n 

0 

a n b\ 

■ a n - 1 bo b\ 

0 b 0 

0 0 

: 0 ■■■ 

ao ai : 

0 ao 0 0 


0 

0 


•• b 0 
0 0 



(2.3.24) 


Proof If Consider the following polynomial equation 

a(s)c(s) + b(s)d(s) = 1 (2.3.25) 

where c(s) = c„_is n_1 +c n _ 2 s ra_2 H bc 0 ,ci(s) = d„__is" _1 +d„_ 2 s" -2 H hd 0 

are some polynomials. Equating the coefficients of equal powers of s on both sides 
of (2.3.25), we obtain the algebraic equation 

SeP = &2n 

where e 2n = [0, 0, . . . , 0, 1] T and 

P — [bn— 1 ? C n — 2 , . . . , Cq , d n — i , d n — 2 , • . . , do] 


(2.3.26) 
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Equations (2.3.25) and (2.3.26) are equivalent in the sense that any solution of 
(2.3.26) satisfies (2.3.25) and vice versa. Because S e is nonsingular, equation (2.3.26) 
has a unique solution for p. It follows that (2.3.25) also has a unique solution for 
c(s) and d(s) which according to Lemma 2.3.1 implies that a(s),b(s) are coprime. 

Only if We claim that if a(s) and 6(s) are coprime, then for all nonzero polynomials 
p(s) and q(s) of degree n p < n and n q < n, respectively, we have 

a(s)p(s) + b(s)q(s) ^ 0 (2.3.27) 

If the claim is not true, there exists nonzero polynomials pi (s) and qi (s) of degree 
n pi < n and n qi < n , respectively, such that 

a(s)pi(s) + b(s)qi(s) = 0 (2.3.28) 

Equation (2.3.28) implies that b(s)/a(s ) can be expressed as 

Kf) = Pi(s) 

a(s) qi{s) 

which, because n Pl < n and n qi < n, implies that a(s) and b(s) have common 
factors, thereby contradicting the assumption that a(s),b(s) are coprime. Hence, 
our claim is true and (2.3.27) holds. 

Now (2.3.27) may be written as 


S e x ± 0 (2.3.29) 

where x € 1Z 2n contains the coefficients of p(s),q(s). Because (2.3.27) holds for all 
nonzero p(s) and q(s) of degree n p < n and n q < n, respectively, then (2.3.29) holds 
for all vectors x £ lZ 2n with x ^ 0, which implies that S e is nonsingular. □ 

The determinant of S e is known as the Sylvester resultant and may be 
used to examine the coprimeness of a given pair of polynomials. If the 
polynomials a(s) and b(s) in Theorem 2.3.2 have different degrees — say nb < 
n a — then b(s) is expressed as a polynomial of degree n a by augmenting it 
with the additional powers in s whose coefficients are taken to be equal to 
zero. 

Example 2.3.4 Consider the polynomials 


a(s) = s 2 + 2s + 1, b(s) = s — 1 = Os 2 + s — 1 
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Their Sylvester matrix is given by 


S e = 


10 0 0 

2 11 0 

12-11 
0 10-1 


Because det/S^ = 4 yf 0, a(s) and b(s) are coprime polynomials. 


V 


The properties of the Sylvester matrix are useful in solving a class of 
Diophantine equations of the form 

l(s)a(s) + p(s)b(s) = a*(s) 

for l(s) and p(s) where a(s),b(s), and a*(s) are given polynomials. 

For example, equation a(s)l(s ) + b(s)p(s) = a*(s) with n a = n,n a * = 
2n — 1, and = m < n implies the algebraic equation 

S e x = f (2.3.30) 

where S e £ 7 Z 2nx2n is the Sylvester matrix of a(s),b(s), and x £ IZ 2n is a 
vector with the coefficients of the polynomials l{s ) and p{s) whose degree 
according to Theorem 2.3.1 is at most n — 1 and / £ IZ 2n contains the 
coefficients of a*(s). Therefore, given a*(s),a(s), and b(s), one can solve 
(2.3.30) for x, the coefficient vector of l(s) and p(s). If a(s), b(s) are coprime, 
5" 1 exists and, therefore, the solution of (2.3.30) is unique and is given by 

x = S-\f 

If a(s),b(s ) are not coprime, then S e is not invertible, and (2.3.30) has a 
solution if and only if the vector / is in the range of S e . One can show through 
algebraic manipulations that this condition is equivalent to the condition 
that a*(s) contains the common factors of a(s) and b(s). 

Example 2.3.5 Consider the same control design problem as in Example 2.3.3, 
where the control input u = is used to force the plant y = ^ -u to satisfy 

(s + 1 ) 5 y = 0. We have shown that the polynomials l(s ) and p(s) satisfy the 
Diophantine equation 


l(s)a(s ) +p(s)6(s) = (s + l) 5 


(2.3.31) 
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1. The corresponding Sylvester matrix S e of a(s) 

1 0 0 0 0 0 ' 

0 1 0 0 0 0 

0 0 11 0 0 

000-11 0 
0 0 0 0 -1 1 

0 0 0 0 0 -1 

Because detS e = —1, we verify that a(s),6(s) are coprime. 

As in Example 2.3.3, we like to solve (2.3.31) for the unknown coefficients 
= 0, 1,2 of the polynomials l(s) = I 2 S 2 + l\s + Iq and p(s) = P 2 S 2 + pis + po- 
By equating the coefficients of equal powers of s on each side of (2.3.31), we obtain 
the algebraic equation 

S e x = f (2.3.32) 


where a(s) = s 3 and b(s) = s — 
and b(s) is 


S e = 


where / = [1,5, 10, 10,5, 1] T and x = [I 2 , h, Io,P 2 ,Pi,Po] T ■ Because S e is a nonsin- 
gular matrix, the solution of (2.3.32) is given by 


x = S~ 1 f=[ 1,5, 26,-16, -6,-1,] t 


which is the same as the solution we obtained in Example 2.3.3 (verify!). V 


2.4 Plant Parametric Models 

Let us consider the plant represented by the following minimal state-space 
form: 

x = Ax + Bu, x(0) = x 0 /o a 

y = C T x [ZAA} 

where x G 7Z n ,u G IZ 1 , and y G 77. 1 and A, B, and C have the appropriate 
dimensions. The triple (A, B , C) consists of n 2 +2n elements that are referred 
to as the plant parameters. If (2.4.1) is in one of the canonical forms studied 
in Section 2.2.2, then n 2 elements of (A,B,C) are fixed to be 0 or l and at 
most 2 n elements are required to specify the properties of the plant. These 
2 n elements are the coefficients of the numerator and denominator of the 
transfer function For example, using the Laplace transform in (2.4.1), 
we obtain 


Y(s) = C T (sI - A)~ l BU (s) + C T (.sI - A)~ 1 xq 
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which implies that 


y( s ) 


Z(s) 

R(s) 


U(s) + 


C T {a,dj(sI-A)}~ 

tji \ x 0 

R{s) 


(2.4.2) 


where R(s) is a polynomial of degree n and Z(s) of degree at most n — 1. 
Setting xq = 0, we obtain the transfer function description 


V = 


Z(s) 

R(s) 


(2.4.3) 


where without loss of generality, we can assume Z(s) and R(s) to be of the 
form 


Z(s) — b n —is n ^ + b n - 2 s n + • • • + b\s + bo (2.4.4) 

R{s) = s n + a n —\s n + • • • + o,\s + ao 


If Z(s) is of degree rn < n— 1, then the coefficients bi, i = n— 1, n— 2, . . . , m+1 
are equal to zero. Equations (2.4.3) and (2.4.4) indicate that at most 2 n 
parameters are required to uniquely specify the I/O properties of (2.4.1). 
When more than 2 n parameters in (2.4.3) are used to specify the same I/O 
properties, we say that the plant is overparameterized. For example, the 
plant 


Z(s)A(s) 
y R{s)A{s) 


(2.4.5) 


where A (s) is Hurwitz of arbitrary degree r > 0, has the same I/O properties 
as the plant described by (2.4.3), and it is, therefore, overparameterized. In 
addition, any state representation of order n+r > n of (2.4.5) is nonminimal. 

For some estimation and control problems, certain plant parameteriza- 
tions are more convenient than others. A plant parameterization that is 
useful for parameter estimation and some control problems is the one where 
parameters are lumped together and separated from signals. In parameter 
estimation, the parameters are the unknown constants to be estimated from 
the measurements of the I/O signals of the plant. 

In the following sections, we present various parameterizations of the 
same plant that are useful for parameter estimation to be studied in later 
chapters. 
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2.4.1 Linear Parametric Models 
Parameterization 1 

The plant equation (2.4.3) may be expressed as an nth-order differential 
equation given by 

y (n) + a n _iy (n-1) H h cioV = b n _ 1 u ( ' n ~ 1 ' ) + 6 n _ 2 it (n-2) H h b 0 u (2.4.6) 

If we lump all the parameters in (2.4.6) in the parameter vector 
b — [bn — i > b n — 2 , . . . ,b(), a n —i, a n —2 , • • • , Q-o] 


and all I/O signals and their derivatives in the signal vector 
= k-l(*)« 5 -«n-l(s)*/] T 


where cti(s) = [s*, s* 1 , . . . , 1] T , we can express (2.4.6) and, therefore, (2.4.3) 
in the compact form 

y {n) = e *T y ( 2 . 4 . 7 ) 

Equation (2.4.7) is linear in 6*, which, as we show in Chapters 4 and 5, is 
crucial for designing parameter estimators to estimate 6* from the measure- 
ments of y l ' n ' 1 and Y . Because in most applications the only signals available 
for measurement is the input u and output y and the use of differentiation 
is not desirable, the use of the signals y ^ and Y should be avoided. One 
way to avoid them is to filter each side of (2.4.7) with an nth - order stable 

filter t7-t to obtain 
A(s) 

z = 6* T (j) (2.4.8) 



and 


A(s) — s n T A n _is n + • • • + Aq 
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is an arbitrary Hurwitz polynomial in s. It is clear that the scalar signal z and 
vector signal 4> can be generated, without the use of differentiators, by simply 
filtering the input u and output y with stable proper filters = 0, 1, . . . n. 

If we now express A(s) as 

A(s) = s 11 + A T a n -i(s) 


where A = [A n _i, A n - 2 , . . . , Aq] t , we can write 


s'- A(s) - A T a n _i(s) xT a n _i(s) 


A(s) V 


A (s) 


y = y- A 


A(s) 


Therefore, 


a n—l ( s ) 

y=z+x - wr v 


Because z = 9* T 4> = 6 * T </> i + 9f fo, where 

9 1 — [bn— 1 > b n —2i • • • , bo] j ^2 — [®n— 1 j dn—2 > • • ■ > ®o] 


i A <a n _i(s) A cr n _i(s) 

91 = 7 i \ 92 = TWA y 

A (s) A(s) 

it follows that 

y = Of <t>i + 9ffo - A t 0 2 

Hence, 

V = 0fcj> (2.4.9) 

where #1) = [9f ,9f — A T ] T . Equations (2.4.8) and (2.4.9) are represented 
by the block diagram shown in Figure 2.2. 

A state-space representation for generating the signals in (2.4.8) and 
(2.4.9) may be obtained by using the identity 


[adj(s/ - A C )]Z = a n -i(s) 

where A c , l are given by 


A c = 

An— 1 

1 

— An-2 • • • 

0 

— Ao 

0 

, 1 = 

1 

0 


1 

O • 

1 

i 

• O 


1 

• o 

1 




Figure 2.2 Plant Parameterization 1. 
which implies that 

det(s/ — A c ) = A(s), (si - A c )-'l = 

Therefore, it follows from (2.4.8) and Figure 2.2 that 

01 = A c <pi + lu, 4 > i G TV 1 

<h = A c 4>2 - ly, (f >2 G TZ n 

y = Oftj) (2.4.10) 

z = y + A T (/>2 = 0 * T 4 > 

Because A(s) = det(s/ — A c ) and A (s) is Hurwitz, it follows that A c is a 
stable matrix. 

The parametric model (2.4.10) is a nonminimal state-space representa- 
tion of the plant (2.4.3). It is nonminimal because 2 n integrators are used 
to represent an nth-order system. Indeed, the transfer function Y(s)/U(s) 
computed using (2.4.10) or Figure 2.2, i.e. , 

Y(s) _ Z(s) A(s) _ Z(s) 

U(s ) ~ R(s)A(s) ~ R(s j 

involves n stable zero-pole cancellations. 

The plant (2.4.10) has the same I/O response as (2.4.3) and (2.4.1) 
provided that all state initial conditions are equal to zero, i.e., xq = 0, 
</>i (0) = 02(0) = 0. In an actual plant, the state x in (2.4.1) may represent 
physical variables and the initial state xq may be different from zero. The 
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effect of the initial state xo may be accounted for in the model (2.4.10) by 
applying the same procedure to equation (2.4.2) instead of equation (2.4.3). 
We can verify (see Problem 2.9) that if we consider the effect of initial con- 
dition xo, we will obtain the following representation 

0 i = A c (f)i + lu , 0i (0) = 0 
02 = A c 02 - ly, 02 ( o) = 0 
y = Ofcfi + r, o (2.4.11) 

2 = y + A t 02 = 0* T (j) + r]o 

where r/o is the output of the system 

ui = A c u, w(0) = LOq 

% = Cjtu (2.4.12) 

where uj G lZ n , uo = BqXq and Co G IZ n ,Bo G , JZ nxn are constant matrices 
that satisfy C l 0 r {adj(sJ — A c )}l?o = C T {adj(s/ — A)}. 

Because A c is a stable matrix, it follows from (2.4.12) that lu, ?/o converge 
to zero exponentially fast. Therefore, the effect of the nonzero initial condi- 
tion xo is the appearance of the exponentially decaying to zero term r/o in 
the output y and z. 

Parameterization 2 

Let us now consider the parametric model (2.4.9) 

y = o* x T <p 

and the identity W m (s)W 1 ~ 1 (s) = 1, where W m (s) = Z m (s)/R m (s ) is a 
transfer function with relative degree one, and Z m (s) and R m (s) are Hurwitz 
polynomials. Because 6\ is a constant vector, we can express (2.4.9) as 

y = w m (s)e^w-\s)ct> 

If we let 

I A 1 T o£-i(g) oj-i (a) 1 

^ ^(s)^ [w m (. S )A( S ) U ’ W m (s)A(s) y _ 
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Figure 2.3 Plant Parameterization 2. 


we have 

y = W m (s)d?il> (2.4.13) 

Because all the elements of are P ro P er transfer functions with 

stable poles, the state ip = [ipj , ip J] T , where 


ipi 


Q! n _l (s) 

W m (s)A(s) 


U,lp2 = 


a n _i(s) 

W m (s)A(s) y 


can be generated without differentiating y or it. The dimension of ip depends 
on the order n of A(s) and the order of Z m (s). Because Z m (s ) can be 
arbitrary, the dimension of ip can be also arbitrary. 

Figure 2.3 shows the block diagram of the parameterization of the plant 
given by (2.4.13). We refer to (2.4.13) as Parameterization 2. In [201], 
Parameterization 2 is referred to as the model reference representation and 
is used to design parameter estimators for estimating 6 1 when W m (s) is a 
strictly positive real transfer function (see definition in Chapter 3). 

A special case of (2.4.13) is the one shown in Figure 2.4 where 


W m {s) 


1 

s + Aq 


and ( s + Ao) is a factor of A(s), i.e., 


A(s) — (s + Ao)Ag(s) — s n + A n _is n 1 + • • • + Ao 


where 


A-q( s ) — S>1 1 + Qn-2S n 2 + • • • + qiS + 1 
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Figure 2.4 Plant Parameterization 2 with A (s) 


(s + Ao)Aq(s) and 


The plant Parameterization 2 of Figure 2.4 was first suggested in [131], 
where it was used to develop stable adaptive observers. An alternative para- 
metric model of the plant of Figure 2.4 can be obtained by first separating 
the biproper elements of 88 follows: 

For any vector c = [c„_i, c n _ 2 , . . . , ci, co] T € 7Z n , we have 

c T a n -i(s) _ c n —is n l c T a n - 2 (s) l‘2A-\A'\ 

A q (s) A q (s) + A q (s) [ - ■ 1 


where c = [c n _ 2 , . . . , ci, co] T , a n -2 = [s n 2 ,...,s,l] T . Because A q (s) = 
s n l +q T a n - 2 (s), where q = [q n - 2 , ■ ■ ■ , qi, 1] T , we have s n_1 = A q (s) - 
q T a n - 2 , which after substitution we obtain 


c T a n _i(s) _ (c - Cn-iqY a n - 2 {s) 

A q (s) ~ Cn ~ 1+ A q {s) 


(2.4.15) 


We use (2.4.15) to obtain the following expressions: 


,,*T a n-l( s ) _ t . n*T a n- 2 (s ) 

6*i t , r U — b n -iU + d 1 — - r Zt 

A q{s) A q (s) 

la * T .T^n-l^) _ (\ \ z*T a n-2( s ) /„ , 1fi 7 

(v 2 ^ ) a / \ V ~ (^n -1 CL n -l)y 0 2 . , . y (2.4.16) 

A q{s) A q(s) 


where 0| T = b — b n -\q : 9 2 T = a — A — (a n _i — A n _i)g and a = [a n _ 2 , 
. . ., ai, a 0 ] T , b = [b n -i, b 0 ] T , A = [A n _ 2 , . . . , Ai, A 0 ] T . Using (2.4.16), 
Figure 2.4 can be reconfigured as shown in Figure 2.5. 
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Figure 2.5 Equivalent plant Parameterization 2. 

A nonminimal state space representation of the plant follows from Fig- 
ure 2.5, i.e. , 

x'i = — Aoxi + #* T V’, x'i € 1Z 1 
'ijj 1 = A c ^i + lu, G 7£ n_1 

= A jfa-Ty, $2 sir- 1 (2.4.17) 

y = x\ 

where 6* = [6 n _ i,0* T ,A n _i -a n _i,0| T ] T , $ = [u, Vq 1 ", y, Y^] 7 " and 


Qn - 2 

CO 

1 

g 

1 

-qo 


' 1 ' 

1 

0 

0 

, i= 

0 





0 

1 

0 


0 


As with Parameterization 1, if we account for the initial condition x(0) = 
xq / 0, we obtain 

xi = — Aoxi + 0* T 'ip, aq(0)=0 

= Ac^i + Tu, V’i(O) = 0 

-02 = A c ^2 - ly , ^ 2 ( 0 ) = 0 (2.4.18) 

y = xi + yo 

where r/o is the output of the system 

w = A c w, w(0) = ujq, lo G 7£ n 
% = Cju 
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where A c , Co, and cjq are as defined in (2.4.12). 


Example 2.4.1 (Parameterization 1) Let us consider the differential equation 

+ a 2 y ( ' 2) + aoy = b 2 u ^ + b 0 u (2.4.19) 

that describes the motion of the cart with the two pendulums considered in Exam- 
ples 2.2.1, 2.3.1, where 


a 2 — — l.l(aq + 02)1 0,0 — 1.2aiQi2; b 2 — 60 — —cz\(3 2 


Equation (2.4.19) is of the same form as (2.4.6) with n = 4 and coefficients 03 = 
ai = &3 = 61 = 0. Following (2.4.7), we may rewrite (2.4.19) in the compact form 

y (4) = g *T Yo (2.4.20) 

where = [ b 2 , bo, a 2 , ao] T , Y 0 = [u^ 2 \u, — y^ 2 \ — y] T . Because y and u are the only 
signals we can measure, yA\Y 0 are not available for measurement. 

If we filter each side of (2.4.20) with the filter where A(s) = (s + 2) 4 = 
s 4 + 8s 3 + 24s 2 + 32s + 16, we have 


z = 


9S T 0 


(2.4.21) 


where z = (a | 2)4 y, <j > 0 


' 


are now signals 


(s+ 2) 4 u ’ (s+ 2) 4U > ~ (s+2) 4 y 1 (s+ 2) 4 y 

that can be generated from the measurements of y and u by filtering. Because in 
(2.4.19) the elements 03 = a\ = & 3 = b\ = 0, the dimension of 0g,</> 0 is 4 instead of 
8, which is implied by (2.4.8). 

Similarly, following (2.4.9) we have 


y = ^ 


(2.4.22) 


where 


9* x = [0, 62, 0, 6 0 , -8, a 2 - 24, -32, a 0 - 16] 

Jr.. . 

,3 „2 inT 


<+[(«) “sO) nT 


-?/ 

.(s + 2) 4 ’ (s + 2) 4 ' 
elements of 9 X that do not depend on the parameters of (2.4.19) to obtain 

V = 05 a </>o + hj(j> 


iV 


^ 3 ( 5 ) [ S 5 S ) 1 ] 


where 9q X = [b 2 , b 0 , a 2 — 24, a 0 — 16] T ,/i 0 = [0, 0, 0, 0, — 8, 0, — 32, 0] T . We obtain a 
state-space representation of (2.4.21) and (2.4.22) by using (2.4.10), i.e., 

c t>i = A C cf>i + lu , ()>i G 1Z 4 

<t >2 = A c 4> 2 - ly, <f >2 G U 4 

y = 9* x r cl ) =9* 0 J^ + hU 

Z = 9q T 4>o 
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where 


A n = 


' -8 


-24 

-32 

-16 ' 



' 1 ' 

1 


0 

0 

0 


l = 


0 

0 


1 

0 

0 

5 


0 

0 


0 

1 

0 



0 


' 0 

1 

0 

0 

0 0 

0 

0 ' 



0 

0 

0 

1 

0 0 

0 

0 


] = 

0 

0 

0 

0 

0 1 

0 

0 

<t> 


0 

0 

0 

0 

0 0 

0 

1 



and <j> = [4>J , <t >2 ]• Instead of (2.4.22), we can also write 

V = 0q T 4>o ~ A t </> 2 


where A = [8, 24, 32, 16] T . 


V 


Example 2.4.2 (Parameterization 2) Consider the same plant as in Exam- 
ple 2.4.1, i.e., 

y = o* x T <t> 

where = [0, 62, 0, bo, —8, a 2 — 24, —32, ao — 16], 




» 


-,T 


Now we write 


where 


/ A 
= 


Using simple algebra, we have 


(s + 2) 4 ’ (s + 2) 4 

a 3 ( s ) (s) 


-,T 


rU, 


(s + 2)3 ’ (s + 2)3‘ 


o 3 (s) _ 1 

(s + 2) 3 _ (s + 2) 3 


' s 3 ' 


" 1 ' 


' -6 

-12 

-8 ' 

s 2 


0 

1 

1 

0 

0 

s 

— 

0 

+ (s + 2) 3 

0 

1 

0 

1 


0 


0 

0 

1 


o 2 (s) 


where 02(a) = [s 2 ,s, 1] T . Therefore, ip can be expressed as 


= 


0:2(5) 


u-\ l u , -y + A T 7 a2 ^y, 


o 2 (s) 




(s + 2) 3 ’(s + 2) 3 


(s + 2) 3 (s + 2) 3 ' 
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where A = [6, 12, 8] , and 9 X i p can be expressed as 


9* x T ip = 9*' ip 


*T„ 


where 


9* = [b 2 , 0, b 0 , 8, a 2 + 24, 64, a 0 + 48] _ 


iP = 


Q^O) 

(s + 2)3 


■ u,y, 


q^O) 

(s + 2)3 


Therefore, 


V = 


1 9* r jj 


s T 2 


A state-space realization of (2.4.24) is 

x\ = —2x\ + 9* ' ip, Xi £ VA 

ip i = A c ipi + lu, 4>i G V? 

i> 2 = K c ip 2 - ly, 4> 2 G R 3 

y = x i 

where ip = [ipj , y, ipj ] T , 


A„ = 


' -6 

-12 

-8 ' 


' i ' 

1 

0 

0 

, i = 

0 

0 

1 

0 


0 


(2.4.23) 


(2.4.24) 


V 


2.4.2 Bilinear Parametric Models 

Let us now consider the parameterization of a special class of systems ex- 
pressed as 

y = k oj^f\ u (2.4.25) 

R 0 {s) 

where fco is a scalar, Rq(s) is rnonic of degree n, and Zq(s) is rnonic and 
Hurwitz of degree m < n. In addition, let Zq(s) and Rq(s) satisfy the 
Diophantine equation 

k 0 Z 0 (s)P(s) + R 0 {s)Q(s) = Zq(s)A(s) (2.4.26) 


where 


Q(s) = s n 1 + q T a n - 2 (s ) 
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P(s) = p T a n -\{s) 

<**(») = 

q £ 1Z n ~ l ,p G 7Z n are the coefficient vectors of Q(s) — s n ~ 1 , P(s ), respec- 
tively, and H(s) is a rnonic Hurwitz polynomial of degree 2 n — m — 1. The 
Diophantine equation (2.4.26) relating Zq(s), Rq(s), ko to P(s),Q(s), and 
.A(s) arises in control designs, such as model reference control, to be dis- 
cussed in later chapters. The polynomials P(s) and Q(s) are usually the 
controller polynomials to be calculated by solving (2.4.26) for a given H(s). 
Our objective here is to obtain a parameterization of (2.4.25), in terms of 
the coefficients of P(s) and Q(s), that is independent of the coefficients of 
Zq(s) and Ro(s). We achieve this objective by using (2.4.26) to eliminate 
the dependence of (2.4.25) on Zo(s) and Ro(s) as follows: 

From (2.4.25), we obtain 

Q(s)R 0 (s)y = k 0 Z 0 (s)Q(s)u (2.4.27) 

by rewriting (2.4.25) as Ro(s)y = koZo(s)u and operating on each side by 
Q(s). Using Q(s)Ro(s) = Zq(s)(A(s ) — koP(s)) obtained from (2.4.26) in 
(2.4.27), we have 


Z 0 (s)(A(s) - k 0 P{s))y = k 0 Z 0 (s)Q(s)u (2.4.28) 

Because Zq(s) is Hurwitz, we filter each side of (2.4.28) by obtain 

A(s)y = k 0 P{s)y + k 0 Q{s)u (2.4.29) 

and write (2.4.29) as 

A(s)y = k 0 \p T a n -i{s)y + q T a n - 2 {s)u + s n_1 «] (2.4.30) 


We now have various choices to make. We can filter each side of (2.4.30) 
with the stable filter -rbr and obtain 


y = k 0 


T a n— 1 

P 


T a n-2{s) S 


,n— 1 


Ms) 


-U + 


A(s) 


which may be written in the compact form 


y = k 0 (6* T (j) + z 0 ) 


(2.4.31) 
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where 9* = [q T ,p T ] T ,4> 


a I-A s ) 

Ms) 


u, 


a i[-lW 
a(s) y 


lT 


and zq 


Ms) 


u. 


We can 


also filter each side of (2.4.30) using an arbitrary stable filter whose 
order ri\ satisfies 2re — m — 1 > n\ > n — 1 to obtain 


y = W (s)k 0 (9* T (j) + z Q ) 


(2.4.32) 


where now 


«I-2( S ) 

AW 


u, 


a J-iW 

AW 



Zo 


Ms) 


u, and W(s) 


AW 

AW 


is a 


proper transfer function. 

In (2.4.31) and (2.4.32), (j> and zq may be generated by filtering the input 
u and output y of the system. Therefore, if u and y are measurable, then 
all signals in (2.4.31) and (2.4.32) can be generated, and the only possible 
unknowns are ko and 9*. If k q is known, it can be absorbed in the signals (j) 
and z o, leading to models that are affine in 9* of the form 


y = W(s)d* T <f> 


(2.4.33) 


where y = y — W(s)koz , 4> = ko4>. If ko, however, is unknown and is part 
of the parameters of interest, then (2.4.31) and (2.4.32) are not affine with 
respect to the parameters ko and 9*, but instead, ko and 9* appear in a 
special bilinear form. For this reason, we refer to (2.4.31) and (2.4.32) as 
bilinear parametric models to distinguish them from (2.4.7) to (2.4.9) and 
(2.4.33), which we refer to as linear parametric or affine parametric models. 
The forms of the linear and bilinear parametric models are general enough 
to include parameterizations of some systems with dynamics that are not 
necessarily linear, as illustrated by the following example. 


Example 2.4.3 Let us consider the nonlinear scalar system 

x = do f(x, t) + bog(x, t) + CqU (2.4.34) 

where ao,bo, and cq are constant scalars; f(x,t) and g(x,t) are known nonlinear 
functions that can be calculated at each time t: and u, x is the input and state of 
the system, respectively. We assume that /, g, and u are such that for each initial 
condition x(0) = xo, (2.4.34) has only one solution defined for all t € [0, oo). If x and 
u are measured, (2.4.34) can be expressed in the form of parametric model (2.4.33) 
by filtering each side of (2.4.34) with a stable strictly proper transfer function Wf(s), 
i.e., 


z = w f (s)e* T <j) 


(2.4.35) 
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where z = sWf(s)x, 0* = [ao,&OjCo] T , and <j> = [f{x,t),g(x,t),u] T . Instead of 
(2.4.35), we may also write (2.4.34) in the form 

x = —a m x + a m x + 9 * t <f> 


for some a m > 0, or 


Then 


1 


x = 


s + a r 


-[ a m x + 9 


*T , 


A 

Z = X — 


(Ir 


1 


-X = 


e* T (t) 


s + a m s + a n 


(2.4.36) 


which is in the form of (2.4.35) with Wf(s) = s+ 1 a . We may continue and rewrite 
(2.4.35) (respectively (2.4.36)) as 


z = 0* T 4>f, <f>f = W f {s)(t> 
which is in the form of (2.4.8). 


(2.4.37) 

V 


The nonlinear example demonstrates the fact that the parameter 6* ap- 
pears linearly in (2.4.35) and (2.4.37) does not mean that the dynamics are 
linear. 


2.5 Problems 


2.1 Verify that x(t) and y(t) given by (2.2.4) satisfy the differential equation (2.2.2). 

2.2 Check the controllability and observability of the following systems: 


(a) 


(b) 


x 

y 


— 0.2 

-1 

[- 1 , 1 ]* 


0 

0.8 


x + 


1 

1 


u 


' -1 

1 

0 


' 0 ' 

0 

-1 

0 

x + 

1 

0 

0 

-2 


1 


= [ 1 , 1 , 1 ]* 


y 



62 


CHAPTER 2. MODELS FOR DYNAMIC SYSTEMS 



2.3 Show that ( A , B) is controllable if and only if the augmented matrix [si — A, B] 

is of full rank for all s £ C. 

2.4 The following state equation describes approximately the motion of a hot air 

balloon: 



p[0 0 l]x 

where X \ : the temperature change of air in the balloon away from the equilib- 
rium temperature; : vertical velocity of the balloon; x 3: change in altitude 
from equilibrium altitude; u: control input that is proportional to the change 
in heat added to air in the balloon; w: vertical wind speed; and a, Ti,t 2 are 
parameters determined by the design of the balloon. 



(a) Let w = 0. Is the system completely controllable? Is it completely 

observable? 

(b) If it is completely controllable, transform the state-space representation 
into the controller canonical form. 

(c) If it is completely observable, transform the state-space representation 

into the observer canonical form. 
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(d) Assume w = constant. Can the augmented state x a = [a; T ,w] T be 
observed from yl 

(e) Assume u = 0. Can the states be controlled by «,'? 

2.5 Derive the following transfer functions for the system described in Problem 2.4: 

(a) Gi(s) == when w = 0 and y = x 3 . 

(b) G 2 (s) = when u = 0 and y = x 3 . 

(c) G 3 (s) = when w = 0 and y 1 = x l . 

(d) G 4 (s) = when u = 0 and y 1 = aq. 

2.6 Let a(s) = (s + cc) 3 , b(s) = /3, where a, (3 are constants with /3 0. 

(a) Write the Sylvester matrix of a(s ) and b(s). 

(b) Suppose Po(s),Iq(s) is a solution of the polynomial equation 

a(s)l(s) + b(s)p(s) = 1 (2.5.1) 

Show that (pi(s),li(s)) is a solution of (2.5.1) if and only if pi(s),li(s) 
can be expressed as 


Pi{s) = Po{s) + r{s)a(s ) 
h{s) = l 0 (s) - r(s)b(s) 

for any polynomial r(s). 

(c) Find the solution of (2.5.1) for which p(s) has the lowest degree and 
p(s)/l(s) is a proper rational function. 

2.7 Consider the third order plant 


y = G(s)u 

where 

. b 2 s 2 + hs + bo 

G(s) — -3— y— — — 

s J + a 2 s z + ais + a 0 

(a) Write the parametric model of the plant in the form of (2.4.8) or (2.4.13) 

when 9* = [b 2 , 61, 60, a 2 , 01, ao] T - 

(b) If oo,ai, and a 2 are known, i.e., ao = 2,a\ = 1, and a 2 = 3, write a 
parametric model for the plant in terms of 9* = [62, &i 7 ^o] T - 

(c) If bo, bi, and b 2 are known, i.e., bo = 1, b\ = b 2 = 0, develop a parametric 

model in terms of 9* = [a2,ai,ao] T . 
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2.8 Consider the spring- mass-dashpot system shown below: 

u 


I 



where k is the spring constant, f the viscous-friction or damping coefficient, 
m the mass of the system, u the forcing input, and x the displacement of 
the mass M. If we assume a “linear” spring, i.e. , the force acting on the 
spring is proportional to the displacement, and a friction force proportional 
to velocity, i.e., x , we obtain, using Newton’s law, the differential equation 

Mx = u — kx — fx 

that describes the dynamic system. 

(a) Give a state-space representation of the system. 

(b) Calculate the transfer function that relates x with u. 

(c) Obtain a linear parametric model of the form 



where 0* = [. M , k, f] T and z, <fi are signals that can be generated from 
the measurements of u,x without the use of differentiators. 

2.9 Verify that (2.4.11) and (2.4.12) are nonminimal state-space representations 

of the system described by (2.4.1). Show that for the same input u(t), the 
output response y(t) is exactly the same for both systems. (Hint: Verify that 
Cq [adj(sl — A C )]B 0 = C T [adj(sI — H)] for some Co € TZ n ,Bo € 7 Z nxn by 
using the identity 

[adj(si-A)] = s” -1 / + s n ~ 2 (A + a n -J) + s n ~ 3 (A 2 + a n _iH + a„_ 2 /) 

+ • • • + (H n_1 + a^A ™- 2 + ■■■ + ai I) 

and choosing Co such that (Co,A c ) is an observable pair.) 

2.10 Write a state-space representation for the following systems: 
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(a) (j) = a ’X~(s) Ji ^ tt ’ A(s) mon i c °f order n. 

(b) (j) = Xtlf u -> Ai(s) i s m onic of order n — 1. 

(c) (j) = m < n — 1, Ai(s) is monic of order n — 1. 

2.11 Show that 

( S j - a,)- 1 ; = (cj (si - a 0 )- 1 ) t = 

where (A c ,l) is in the controller form and (C 0 , A„) is in the observer form. 

2.12 Show that there exists constant matrices Qi € TJ.f"- 1 )*!"- 1 ) such that 

( T A \ — 1 J _ r\ a n-2( s ) . _ . „ 

-A-0 ) di Cji . , x , Z 1, 2, . . . , Tl 

a (s) 


where c?i = — A; A(s) = s n 1 + A T a„_ 2 (s) = det (si — A 0 ), A 0 = 


In - 2 
0 


di = [0, . . . , 0, 1, 0, . . . , 0] T € TZ" 1 whose (* — l)th element is equal to 1, and 
i = 2, 3, . . . , n. 



Chapter 3 


Stability 


3.1 Introduction 

The concept of stability is concerned with the investigation and characteri- 
zation of the behavior of dynamic systems. 

Stability plays a crucial role in system theory and control engineer- 
ing, and has been investigated extensively in the past century. Some of 
the most fundamental concepts of stability were introduced by the Rus- 
sian mathematician and engineer Alexandr Lyapunov in [133]. The work of 
Lyapunov was extended and brought to the attention of the larger control 
engineering and applied mathematics community by LaSalle and Lefschetz 
[124, 125, 126], Krasovskii [107], Hahn [78], Massera [139], Malkin [134], 
Kalman and Bertram [97], and many others. 

In control systems, we are concerned with changing the properties of dy- 
namic systems so that they can exhibit acceptable behavior when perturbed 
from their operating point by external forces. The purpose of this chapter is 
to present some basic definitions and results on stability that are useful for 
the design and analysis of control systems. Most of the results presented are 
general and can be found in standard textbooks. Others are more specific 
and are developed for adaptive systems. The proofs for most of the general 
results are omitted, and appropriate references are provided. Those that are 
very relevant to the understanding of the material presented in later chapters 
are given in detail. 

In Section 3.2, we present the definitions and properties of various norms 
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and functions that are used in the remainder of the book. The concept of I/O 
stability and some standard results from functional analysis are presented 
in Section 3.3. These include useful results on the I/O properties of linear 
systems, the small gain theorem that is widely used in robust control design 
and analysis, and the £ 2 < 5 - n orm and Bellman- Gronwall (B-G) Lemma that 
are important tools in the analysis of adaptive systems. The definitions of 
Lyapunov stability and related theorems for linear and nonlinear systems are 
presented in Section 3.4. The concept of passivity, in particular of positive 
real and strictly positive real transfer functions, and its relation to Lyapunov 
stability play an important role in the design of stable adaptive systems. 
Section 3.5 contains some basic results on positive real functions, and their 
connections to Lyapunov functions and stability that are relevant to adaptive 
systems. 

In Section 3.6, the focus is on some elementary results and principles that 
are used in the design and analysis of LTI feedback systems. We concentrate 
on the notion of internal stability that we use to motivate the correct way of 
computing the characteristic equation of a feedback system and determining 
its stability properties. The use of sensitivity and complementary sensitiv- 
ity functions and some fundamental trade-offs in LTI feedback systems are 
briefly mentioned to refresh the memory of the reader. The internal model 
principle and its use to reject the effects of external disturbances in feedback 
systems is presented. 

A reader who is somewhat familiar with Lyapunov stability and the basic 
properties of norms may skip this chapter. He or she may use it as reference 
and come back to it whenever necessary. For the reader who is unfamiliar 
with Lyapunov stability and I/O properties of linear systems, the chapter 
offers a complete tutorial coverage of all the notions and results that are 
relevant to the understanding of the rest of the book. 


3.2 Preliminaries 

3.2.1 Norms and C p Spaces 


For many of the arguments for scalar equations to be extended and remain 
valid for vector equations, we need an analog for vectors of the absolute value 
of a scalar. This is provided by the norm of a vector. 
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Definition 3.2.1 The norm \x\ of a vector x is a real valued function with 
the following properties: 

(i) |x| > 0 with |z| = 0 if and only if x = 0 

(ii) \ax\ = |a||a;| for any scalar a 

(iii) \x + y\ < \x\ + \y\ (triangle inequality) 

The norm |x| of a vector x can be thought of as the size or length of the 
vector x. Similarly, \x — y\ can be thought of as the distance between the 
vectors x and y. 

An m x n matrix A represents a linear mapping from ?r-dimensional 
space IZ n into m-dimensional space 7Z m . We define the induced norm of A 
as follows: 

Definition 3.2.2 Let | • | be a given vector norm. Then for each matrix 
A € 7Z' nxn , the quantity ||A|| defined by 

n , n A I Ax\ i . i ,,, 

||A|| = sup — — j- = sup \Ax\ = sup \Ax\ 

|®| b|<l |a;| = l 

x£TZ n 

is called the induced (matrix) norm of A corresponding to the vector 
norm \ ■ |. 

The induced matrix norm satisfies the properties (i) to (iii) of Definition 
3.2.1. 

Some of the properties of the induced norm that we will often use in this 
book are summarized as follows: 

(i) \Ax\ < ||A|||x|, \/x£lZ n 

(ii) ||A + B||<||A|| + ||B|| 

(iii) : flAB|| < P||||S|| 

where A, B are arbitrary matrices of compatible dimensions. Table 3.1 shows 
some of the most commonly used norms on TZ n . 

It should be noted that the function ||A|| S = rnaxjj \a t] \, where A e 
IZmxn an j a . . j g (j ; j) element of A satisfies the properties (i) to (iii) 
of Definition 3.2.1. It is not, however, an induced matrix norm because no 
vector norm exists such that || • || s is the corresponding induced norm. Note 
that || • || s does not satisfy property (c). 
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Table 3.1 Commonly used norms 


Norm on IZ n 

Induced norm on 7 Z mxn 

|x|oo =m axj \xi\ (infinity norm) 

Mil = E i Mil 

Mb = (EiMil 2 ) 1/2 

(Euclidean norm) 

\\A oo = max,; E j Wij | (row sum) 

T ||i = rnaxj Ei \ a ij I (column sum) 
l\A\\ 2 = [A m (^ T 2l)] 1 / 2 , where A m (M) 
is the maximum eigenvalue of M 


Example 3.2.1 (i) Let x = [1, 2, — 10, 0] T . Using Table 3.1, we have 


Moo - 10, Ml = 13, M 2 = M105 


(ii) Let 


A = 


0 5 

1 0 

0 -10 


Using Table 3.1, we have 


B = 


-1 5 
0 2 


Mil! = 15, P|| 2 = 11.18, Halloo = 10 

IMHi = 7, ||B|| 2 = 5.465, ||B||oo = 6 

PB||!=35, \\AB\\ 2 = 22.91, ||TB||oo=20 


which can be used to verify property (iii) of the induced norm. 


V 


For functions of time, we define the T p norm 


x 




i Ip 


for p e [l,oo) and say that x G T p when ||x|| P exists (i.e. , when ||x|| p is 
finite). The Too norm is defined as 


= sup MM) I 
t> 0 


and we say that x 6 Too when ||x||oo exists. 

In the above T p , Too norm definitions, x(t) can be a scalar or a vector 
function. If x is a scalar function, then | • | denotes the absolute value. If x 
is a vector function in IZ n then | • | denotes any norm in IZ n . 
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Similarly, for sequences we define the l p norm as 

/ °° \ Vp 

IMIp = ( \ Xi \ P ) ’ 1 <P< 00 

and the norm as 

n n A . . 

Halloo = SUp \Xi\ 
i> 1 

where x = (. xi,X 2 , • . .) and Xi G TZ. We say x G l p (respectively x G Zoo) if 
||x|| p (respectively HxHoo) exists. 

We are usually concerned with classes of functions of time that do not 
belong to C p . To handle such functions we define the C pe norm 

jM|p = |x(r)| p dT^ P 

for p G [l,oo) and say that x G C pe when ||x*|| p exists for any finite t. 
Similarly, the norm is defined as 

\\xt\\oo = sup |x(r)| 

0 <T<t 

The function t 2 does not belong to C p but t 2 G C pe . Similarly, any continuous 
function of time belongs to C pe but it may not belong to C p . 

For each p G [l,oo], the set of functions that belong to C p (respectively 
C pe ) form a linear vector space called C p space (respectively C pe space) [42], 
If we define the truncated function ft as 

Mr) £ \ f(T) 0 S T S * 

JtK ’ I 0 T > t 


for all t G [0, 00 ), then it is clear that for any p G [1, 00 ], / G C pe implies 
that ft G C p for any finite t. The C pe space is called the extended C p space 
and is defined as the set of all functions / such that ft G C p . 

It can be easily verified that the C p and C pe norms satisfy the properties 
of the norm given by Definition 3.2.1. It should be understood, however, 
that elements of C p and C pe are equivalent classes [42], i.e. , if /, g G C p and 
11/ — d\\p = 0) the functions / and g are considered to be the same element 
of C p even though fit) g(t) for some values of t. The following lemmas 
give some of the properties of C p and C pe spaces that we use later. 
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Lemma 3.2.1 (Holder’s Inequality) If p, q £ [1 , oo] and 1 + ^ = 1, then 
f £ C p , g £ C q imply that fg £ C\ and 

\\fg\\i<\\f\\p\\g\\ q 

When p = q = 2, the Holder’s inequality becomes the Schwartz inequal- 
ity , i.e., 

WfsWi < Wfhhh (3-2.1) 

Lemma 3.2.2 (Minkowski Inequality) For p £ [l,oo], f,g £ C p imply 
that f + g £ C p and 

11/ + g\\ P < \\f\\p + Hsllp (3.2.2) 

The proofs of Lemma 3.2.1 and 3.2.2 can be found in any standard book on 
real analysis such as [199, 200]. 

We should note that the above lemmas also hold for the truncated func- 
tions f t ,gt of f,g, respectively, provided f,g £ C pe . For example, if / and 
g are continuous functions, then f,g £ C pe , i.e., ft,gt £ C p for any finite 
t £ [0, oo) and from (3.2.1) we have ||(/y) t ||i < \\f t Ihllstlh, i.e., 

J o \f(j)g{r)\dT < ^ |/(r)| 2 dr^ ^ |c/(r)| 2 dr^ (3.2.3) 

which holds for any finite t > 0. We use the above Schwartz inequality 
extensively throughout this book. 



o 


V 
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Example 3.2.3 Consider the functions 

f(t) = i + t, g( t ) = Y~ft' for ^ — 0 

It is clear that / ^ C p for any p € [l,oo] and g £ C\. Both functions, however, 
belong to C pe \ and can be used to verify the Schwartz inequality (3.2.3) 

ll(/0)t||l < Wfthhth 


i.e., 

J 1 dr < (/ (l + r) 2 dr 

for any t £ [0, oo) or equivalently 

t < + 3t + 3) 



1 




which is true for any t > 0. 


V 


In the remaining chapters of the book, we adopt the following notation 
regarding norms unless stated otherwise. We will drop the subscript 2 from 
I • 1 2 5 || • 1 1 2 when dealing with the Euclidean norm, the induced Euclidean 
norm, and the £2 norm. If x : 1Z + 1 — > TZ n , then 

\x(t)\ represents the vector norm in TZ n at each time t \\xt\\ p repre- 
sents the C pe norm of the function \x(t)\ ||x|| p represents the C p norm of 
the function |x(f)| 

If A G 7Z mxn , then 

PH* represents the induced matrix norm corresponding to the vector 
norm | • |j. 

If A : IZ + 1 — > iz mxn has elements that are functions of time t, then 

||7l(f)||j represents the induced matrix norm corresponding to the vector 
norm | • |* at time t. 

3.2.2 Properties of Functions 

Let us start with some definitions. 

Definition 3.2.3 (Continuity) A function f : [0,oo) ^>IZ is continuous 

on [0,oo) if for any given eo > 0 there exists a 5(eo,to) such that Vto,t £ 
[ 0 , 00) for which \t — to I < 5(eoj to) we have \f(t) — /’(to) | < e o- 
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Definition 3.2.4 (Uniform Continuity) A function f : [0,oo) i— > IZ is 
uniformly continuous on [0,oo) if for any given eo > 0 there exists a 5(e o) 
such that Vto, t £ [0, oo) for which \t — to I < 5(eo) we have \ f(t) — /(io)| < eo- 

Definition 3.2.5 (Piecewise Continuity) A function f : [0,oo) ^ IZ is 
piecewise continuous on [0, oo) if f is continuous on any finite interval 
[to,ii] C [0,oo) except for a finite number of points. 

Definition 3.2.6 (Absolute Continuity) A function f : [a, b] >—>■ IZ is 
absolutely continuous on [a, b] iff, for any eo > 0, there is a 5 > 0 such that 

n 

1 /("*) “ /(A) I < eo 

1=1 

for any finite collection of subintervals (ck*. Pi) of [a, b] with Ya = i \ a i~ Pi\ < £• 

Definition 3.2.7 (Lipschitz) A function f : [a,b\ — ► IZ is Lipschitz on 

[a, 6] if \ f (xi) — f (x 2 )\ < k\x\ — X 2 \ Vxi,X 2 £ [a, b\, where k > 0 is a constant 
referred to as the Lipschitz constant. 

The function fit ) = sin(|) is continuous on (0, oo), but is not uniformly 
continuous (verify!). 

A function defined by a square wave of finite frequency is not continuous 
on [0,oo), but it is piecewise continuous. 

Note that a uniformly continuous function is also continuous. A function 
/ with / £ Coo is uniformly continuous on [0,oo). Therefore, an easy way 
of checking the uniform continuity of fit) is to check the boundedness of /. 
If / is Lipschitz on [a, b], then it is absolutely continuous. 

The following facts about functions are important in understanding some 
of the stability arguments which are often made in the analysis of adaptive 
systems. 

Fact 1 lim t ^oo f(t) = 0 does not imply that fit) has a limit as t — ► oo. 

For example, consider the function fit) = sin(\/l + 1). We have 

• _ cos v/1 + t 

* ~ TTT 


0 as t — > oo 
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but f(t ) has no limit. Another example is /(f) = \/l + t sin(ln(l + f)), which 
is an unbounded function of time. Yet 


m 


sin(ln(l+f)) cos(ln(l + t)) ^ 

2\/l -\-t \/l + t 


as t 


oo 


Fact 2 lirn^oo f(t ) = c for some constant c &1Z does not imply that f{t) — > 
0 as t — > oo. 

For example, the function /(t) = tends to zero as t — » oo for any 

finite integer n but 

/ = - 8 ‘° i ( 1 + + t ) t 2 ) ’‘ + + *)” -2 C 0 S (1 + ()” 

has no limit for n > 2 and becomes unbounded as t — > oo for n > 2. 

Some important lemmas that we frequently use in the analysis of adaptive 
schemes are the following: 

Lemma 3.2.3 The following is true for scalar-valued functions: 

(i) A function f(t) that is bounded from below and is nonincreasing has a 
limit as t. — > oo. 

(ii) Consider the nonnegative scalar functions f(t), g(t) defined for all t> 
0. If f(t) < g(t) Vf > 0 and g £ C v , then f £ C p for all p £ [1, oo]. 

Proof (i) Because / is bounded from below, its infimum f m exists, i.e., 

fm= <jn| fit) 

0<t<oo 

which implies that there exists a sequence {t n } £ 1Z + such that liuin^oo f(t n ) = f m . 
This, in turn, implies that given any e 0 > 0 there exists an integer N > 0 such that 

\f(t n ) ~ /m| < eo, V?i > N 

Because / is nonincreasing, there exists an no > N such that for any t. > t no and 
some no > N we have 

fit) < /(tn 0 ) 

and 

I fit) - /m| < | /(tno) - /ml < e 0 

for any t >t no . Because eo > 0 is any given number, it follows that lim^oo /(f) = 
fm ■ 
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(ii) We have 

z (t) = (/ / p (r)dr^ < (/ g p {r)d,T^ < oo, Vi > 0 

Because 0 < z(t) < oo and z(t) is nondecreasing, we can establish, as in (i), that 
z(t) has a limit, i.e., Hindoo z{t) = z < oo, which implies that f £ C p . For p = oo, 
the proof is straightforward. □ 

Lemma 3.2.3 (i) does not imply that / £ Coo. For example, the function 
f{t) = j with t £ (0,oo) is bounded from below, i.e., /(f) > 0 and is 
nonincreasing, but it becomes unbounded as t — ► 0. If, however, /(0) is 
finite, then it follows from the nonincreasing property /(f) < /( 0) Vt > 0 
that / £ £oo. A special case of Lemma 3.2.3 that we often use in this book 
is when / > 0 and f < 0. 

Lemma 3.2.4 Let /, V : [0,oo) i— > IZ. Then 

V < —aV + /, Vt > to > 0 


implies that 

V(t) < e- a ^- to) V{t 0 ) + e- a(f - T) /(r)dr, Vt > t 0 > 0 

"to 

/or any /mife constant a. 

Proof Let tu(f) = L + aL — /. We have uj(t) < 0 and 

V = — aV + f + w 

implies that 

V{t) = e-^-^Vito) + f e- a( ‘- T) /(r)dr+ f e" a(t - T) u;(r)dT 

Jt 0 A 0 

Because w(t) < 0 Vt > to > 0, we have 

V(t) < e- a( ‘-*°V(t 0 )+ [ e- a(f - T) /(r)dr 
Jt 0 


□ 
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Lemma 3.2.5 If f, f € Coo and, f G C p for some p E [1, oo), then f(t) — > 0 
as t —> oo. 

The result of Lemma 3.2.5 is a special case of a more general result given 
by Barbalat’s Lemma stated below. 

Lemma 3.2.6 (Barbalat’s Lemma [192]) If lim^oo /q f(r)dr exists 
and is finite, and f(t) is a uniformly continuous function, then lim^oo m = 
0. 


Proof Assume that lim^oc, /(f) = 0 does not hold, i.e. , either the limit does not 
exist or it is not equal to zero. This implies that there exists an eo > 0 such that 
for every T > 0, one can find a sequence of numbers f* > T such that |/(tj)| > eo 
for all i. 

Because / is uniformly continuous, there exists a number <5(eo) > 0 such that 
I fit) - /(f»)| < j for every t € + <5(e 0 )] 


Hence, for every t € [ti,L + <5(eo)], we have 


\m\ = 
> 


i fit) - fie) + f{ti ) i > i/(f*)i - 1 f(t) - f(u ) i 
eo e 0 
£ ° 2 ~ 2 


which implies that 


(■t,-+(5(eo) 


f(r)di 


fti+S{e 0 ) 


\f( T ) \ dr > 


eo^(eo) 


(3.2.4) 


where the first equality holds because /(f) retains the same sign for f € [fj, fi +<5(eo)] - 

On the other hand, g(t) = ( * f(r)dr has a limit as t — » oo implies that there exists 
a T(e 0 ) > 0 such that for any f 2 > fi > T(e 0 ) we have 


\g{h) -g{t 2 )\ < 


£oj(eo) 

2 


i.e., 


which for f 2 = f* + <5(eo) , fi 


f(r)di 


it i 


< 


eo^(e 0 ) 


ti contradicts (3.2.4), and, therefore, lim^oo /(f) = 0. 

□ 
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The proof of Lemma 3.2.5 follows directly from that of Lemma 3.2.6 by 
noting that the function f v (t) is uniformly continuous for any p £ [l,oo) 
because /, / £ £oo . 

The condition that fit) is uniformly continuous is crucial for the results 
of Lemma 3.2.6 to hold as demonstrated by the following example. 

Example 3.2.4 Consider the following function described by a sequence of isosceles 
triangles of base length \ and height equal to 1 centered at n where n = 1 , 2 , . . . oo 
as shown in the figure below: 



This function is continuous but not uniformly continuous. It satisfies 


lim 

t—> OO 


; i oo i 


2 z — ' n* 

n—1 


but limt^oo f{t) does not exist. 


V 


The above example also serves as a counter example to the following situ- 
ation that arises in the analysis of adaptive systems: We have a function V (t) 
with the following properties: V(t) > 0, V < 0. As shown by Lemma 3.2.3 
these properties imply that lim^oo V(t) = exists. However, there is no 
guarantee that V(t) — > 0 as t — > oo. For example consider the function 

V(t) = tt - I f{r)dT 
Jo 

where /(f) is as defined in Example 3.2.4. Clearly, 

V(t) >0, V = -fit) <0, Vf > 0 


and 9 

lim V it) = Vx = 7r — 

r— xx) 1 Z 
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but Hindoo V (t) = — ]imt-,oof(t) does not exist. According to Barbalat’s 
lemma, a sufficient condition for V (t) — > 0 as t — * oo is that V is uniformly 
continuous. 

3.2.3 Positive Definite Matrices 

A square matrix A £ IZ nxn is called symmetric if A = A T . A symmet- 
ric matrix A is called positive semidefinite if for every x £ IZ n , x T Ax > 0 
and positive definite if x T Ax > 0 Vx £ IZ n with \x\ 0. It is called neg- 

ative semidefinite ( negative definite) if —A is positive semidefinite (positive 
definite) . 

The definition of a positive definite matrix can be generalized to non- 
symmetric matrices. In this book we will always assume that the matrix 
is symmetric when we consider positive or negative definite or semidefinite 
properties. 

We write A > 0 if A is positive semidefinite, and A > 0 if A is positive 
definite. We write A > B and A > B if A — B > 0 and A — B > 0, 
respectively. 

A symmetric matrix A £ iz nxn is positive definite if and only if any one 
of the following conditions holds: 

(i) A,; (A) > 0, i = 1, 2, . . . , n where A*(A) denotes the ith eigenvalue of A, 
which is real because A = A T . 

(ii) There exists a nonsingular matrix A\ such that A = A\Aj . 

(iii) Every principal minor of A is positive. 

(iv) x T Ax > a|x| 2 for some a > 0 and Vx £ 7 Z n . 

The decomposition A = A±Aj in (ii) is unique when A\ is also symmetric. 

In this case, A\ is positive definite, it has the same eigenvectors as A, and 
its eigenvalues are equal to the square roots of the corresponding eigenvalues 
of A. We specify this unique decomposition of A by denoting A\ as As, i.e. , 
A = dsiT where A 2 is a positive definite matrix and A T / 2 denotes the 
transpose of A 1 / 2 . 

A symmetric matrix A £ , ]Z nxn has n orthogonal eigenvectors and can 
be decomposed as 

A = U T AU (3.2.5) 

where U is a unitary (orthogonal) matrix (i.e., U T U = I) with the eigen- 
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vectors of A, and A is a diagonal matrix composed of the eigenvalues of A. 
Using (3.2.5), it follows that if A > 0, then for any vector x € 1Z n 

^ min ( A)\x \ 2 < x 1 Ax < X max (A)\x\ 2 

Furthermore, if A > 0 then 


P|| 2 — X m ax(A) 

and if A > 0 we also have 


" " A • (A) 

where A maa ,(A), \ m in(A) is the maximum and minimum eigenvalue of A, 
respectively. 

We should note that if A > 0 and B > 0, then A + B > 0, but it is not 
true in general that AB > 0. 


3.3 Input/Output Stability 

The systems encountered in this book can be described by an I/O mapping 
that assigns to each input a corresponding output, or by a state variable 
representation. In this section we shall present some basic results concerning 
I/O stability. These results are based on techniques from functional analysis 
[42], and most of them can be applied to both continuous- and discrete- 
time systems. Similar results are developed in Section 3.4 by using the state 
variable approach and Lyapunov theory. 

з. 3.1 C p Stability 

We consider an LTI system described by the convolution of two functions 

и. h : 1Z + — > 7 Z defined as 

y(t) = u*h= f h(t — t)u(t)cLt = [ u(t — r)/i(r)dr (3.3.1) 

Jo Jo 

where u,y is the input and output of the system, respectively. Let H(s) be 
the Laplace transform of the I/O operator h(-). H(s) is called the transfer 
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function and h(t) the impulse response of the system (3.3.1). The system 
(3.3.1) may also be represented in the form 

y(s) = H(s)U(s) (3.3.2) 

where T(s), U(s) is the Laplace transform of y,u respectively. 

We say that the system represented by (3.3.1) or (3.3.2) is C p stable if 
u E C p => y E jC p and ||y|| p < c||«|| p for some constant c > 0 and any 
u £ C p . When p = oo, C p stability, i.e., C^ stability, is also referred to as 
bounded-input bounded-output (BIBO) stability. 

The following results hold for the system (3.3.1). 

Theorem 3.3.1 If zi G Up CLTid h G £* i then 


\\y\\ P < IHMMIp 


(3.3.3) 


where p G [1, oo] . 

When p = 2 we have a sharper bound for ||y|| p than that of (3.3.3) given by 
the following Lemma. 

Lemma 3.3.1 If u £ £2 a nd, h £ C\, then 

\\yh < sup|tf(jw)||H| 2 (3.3.4) 

UJ 

For the proofs of Theorem 3.3.1, Lemma 3.3.1 see [42]. 

Remark 3.3.1 It can be shown that (3.3.4) also holds [232] when h(-) is of 
the form 

, m / 0 t < 0 

i j I zr=omt-u) + fa(t) t> 0 

where f a £ C±, 0 \fi I <00 and L are nonnegative finite constants. 

The Laplace transform of h(t) is now given by 

OO 

H(s) = Y J fiZ~ sU +H a (s) 

i = 0 

which is not a rational function of s. The biproper transfer functions 
that are of interest in this book belong to the above class. 
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Remark 3.3.2 We should also note that (3.3.3) and (3.3.4) hold for the 
truncated functions of u,y, i.e., 

\\yt\\p < IHIilkllp 

for any t G [0,oo) provided u G C pe . Similarly, 

\\yth < sup |R(jw)|||wi|| 2 

OJ 

for any t G [0, oo) provided u G C 2e . This is clearly seen by noticing 
that u G C pe => Ut G C p for any finite t > 0. 


It can be shown [42] that inequality (3.3.3) is sharp for p = oo because 
||/i||i is the induced norm of the map T : u i— > Tu = y from C ^ into C oo, i.e., 
||T||oo = \\h\\i- Similarly for (3.3.4) it can be shown that the induced norm 
of the linear map T : C 2 e- > C 2 is given by 

||T|| 2 = sup \H(ju)\ (3.3.5) 

i.e., the bound (3.3.4) is also sharp. 

The induced C -2 norm in (3.3.5) is referred to as the norm for the 
transfer function H(s) and is denoted by 

Ikklloo = sup \H(ju)\ 

Let us consider the simple case where h(t) in (3.3.1) is the impulse response 
of an LTI system whose transfer function H(s ) is a rational function of s. 
The following theorem and corollaries hold. 

Theorem 3.3.2 Let H(s) be a strictly proper rational function of s. Then 
H(s) is analytic in Re[s] >0 if and only if h G C\. 

Corollary 3.3.1 If h G C\, then 

(i) h decays exponentially , i.e., \h(t)\ < a\e~ aot for some ai,ao > 0 

(ii) u G Ci =>• y G C\ H-Coo,?/ G C\, y is continuous and lim^oo \y(t)\ = 0 

(iii) u G C -2 =>• y G C 2 f]Coo,y £ C 2 , y is continuous and lim^oo \y(t)\ = 0 

(iv) For p G [1, 00 ], u G C p =$■ y, y G C p and y is continuous 
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For proofs of Theorem 3.3.2 and Corollary 3.3.1, see [42]. 

Corollary 3.3.2 Let H(s) be biproper and analytic in Refs] > 0. Then u £ 
£2 f| £oo and lirn^oo \u(t) \ = 0 imply that y £ £ 2 f| £ oo and limbec \y(t)\ = 
0. 

Proof H(s) may be expressed as 

H(s) =d + H a (s) 

where d is a constant and H a (s) is strictly proper and analytic in Re[s\ > 0. We 
have 

y = du + y a , y a = H a (s)u 

where, by Corollary 3.3.1, y a £ £2 f] £oo and |j/ a (f)| — * 0 as t — > 00 . Because 
u £ £2 Pi £00 and u (t) — * 0 as t — > 00 , it follows that y £ £2 f] £00 and \y(t)\ — > 0 
as t — > 00 . □ 


Example 3.3.1 Consider the system described by 

y = H(s)u, H{s)=^—— 
s + p 

for some constant a > 0. For (3 > 0, H(s ) is analytic in Refs] > 0. The impulse 
response of the system is given by 


h{t) = 


e <*) t > a 
0 t < a 


and h £ £1 if and only if j3 > 0. We have 

fO o /»oo -1 

ll^ll 1 = I h(t)\dt = J e-' i(t ~ a) dt = - 


and 

||R(s)||oo = Slip 

CO 

V 


0 ~aju> 


ju + f3 


1 

0 


Example 3.3.2 Consider the system described by 

2s + 1 


y = H(s)u, H(s ) 


s + 5 
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The impulse response of the system is given by 

/ 2S A (t)-9e~ 5t t> 0 
h ® = \ 0 t< 0 


where h a = — 9e 5t £ C\. This system belongs to the class described in Re- 
mark 3.3.1. We have 


II #0) Hoc = sup 

OJ 


1 + 2 ju ( 1 + 4 to 2 

5 + jto uF \ 25 + uj 2 


1 

2 


= 2 


Hence, according to (3.3.4) and Remarks 3.3.1 and 3.3.2, for any u £ £20 we have 


\\Vth < 2||ut|| 2 


for any f G [0, 00 ). 


V 


Definition 3.3.1 (/i— small in the mean square sense (m.s.s.)) Let 

x : [0, 00 ) 1 — > TV 1 , where x £ £, 2 e> and consider the set 


S(n) = < x : [0, 00 ) i-o- TV' 


rt+T 


x T {r)x(r)dT < coptT + ci, Vt, T > 0 


for a given constant pt > 0, where co,ci > 0 are some finite constants, and 
co is independent of pi. We say that x is pi— small in the m.s.s. if x £ S{n). 


Using the proceeding definition, we can obtain a result similar to that of 
Corollary 3.3.1 (iii) in the case where u ^ £2 but u £ S(/a) for some constant 
/x > 0. 

Corollary 3.3.3 Consider the system (3.3.1). If h £ C\, then u £ S(pt) 
implies that y £ S(p) and y £ for any finite pi > 0. Furthermore 

, 1 ~ 0 

\y{t)\ L <— T —{cofx + a), Vf>t 0 > 0 

«o (1 — e "°J 

where ao,a\ are the parameters in the bound for h in Corollary 3.3.1 (i). 
Proof Using Corollary 3.3.1 (i), we have 

\y(t)\ < f \h(t — T)u(r)\dT < f aie _ao ^ _T ^|M(r)|dr, Vf > to > 0 

Jto Jt 0 
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for some constants a±,ao > 0. Using the Schwartz inequality we obtain 
\y(t)\ 2 < a 2 j e~ a °^ T ^dT f e~ a ° ( ' t ~ T ^\u( y T)\ 2 dT 

J to J to 

2 pt 

< — / e - a °( t - T >| u(r)\ 2 dT (3.3.6) 

a 0 Jt 0 

Therefore, for any t >t 0 >0 and T > 0 we have 
rt+T 2 rt+T f T 

/ |j/(r)| 2 dr < — / / e~ a °^ T ~ s fu(s)\ 2 dsdT 

J t. OtQ J t J t 


a 

ao 


( /e- Qo(T - s) |u(s)| 2 ds+/ e -“ o(r - s) |w(s)| 2 dsV 

0 Jt \Jtn Jt J 


(3.3.7) 


Using the identity involving the change of the sequence of integration, i.e., 


pt-\-T nr pt-\-T pt-\-T 

J f(r) J g(s)dsdr = J g(s) J f(r)dTds (3.3.8) 


for the second term on the right-hand side of (3.3.7), we have 


r t+T o 2 r t+T r 

/ \y{r)\ 2 dT < — e a ° T dr / 

J t «0 J t J ti 


e aoS \ «("' 12 ' 


t(s)| 2 ds 


„2 r t + T 


Ot 0 Jt 


e a °>(*)| S 


rt+T 


e~ aoT dT ds 


2 { pt pt~\~T 

< ^ / e- a ^ t -^\u{s)\ 2 ds+ / \u{s)\ : 

a 0 \Jt 0 Jt 


ds 


where the last inequality is obtained by using e aot — e “° ( t+T ) < e aot . Because 
u £ S(fi) it follows that 



\y{ T ) \ 2 dr < 


— \ [A(t, t 0 ) + co/rT + ci] 
a o 


(3.3.9) 


where A (t,to)) = fl e a ° ^ S ^|w(s)| 2 <is. If we establish that A (t,t 0 ) < c for some 
constant c independent of t,to then we can conclude from (3.3.9) that y £ <S(/z). 
We start with 


A(i,i 0 ) 



e- ao{t - s) \u(s)\ 2 ds 
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n * ri+ l+t 0 

< e~ aot Y^ / e aoS \u(s)\ 2 ds (3.3.10) 

i = 0 "'*+*0 

n t ri+l+to 

< e - aot J2 eaoii+1+to) / |w(s)| 2 ds 

i= 0 Ji+to 

where nt is an integer that depends on t and satisfies nt + to < t < nt + 1 + to- 
Because u £ S(fi ), we have 


nt i 

A(i, t 0 ) < e-“ ot (c 0 M + ci) V e «°( i + 1 +*o) < CQ/t + Cl e a ° (3.3.11) 

' 1 — e _a ° 


i=0 


Using (3.3.11) in (3.3.9) we have 

l Wr)|2 *' - % ( c "' ,T + Ci + r^S e °”) 

for any t > to > 0. Setting Co = and Ci = (ci + it follows that 

y e 5(/x). 

From (3.3.6), (3.3.10), and (3.3.11), we can calculate the upper bound for |y(t)| 2 . 

□ 


Definition 3.3.1 may be generalized to the case where n is not necessarily 
a constant as follows. 


Definition 3.3.2 Let x : [0, oo) i— ► IZ n , w : [0,oo) i — > TZ + where x£C- 2 e , 
w £ Lie and consider the set 


S(w ) = < x, w 


rt-\-T rt-\-T 

/ x T (r)x(r)dr < co / w(r)dr + ci, Vi, T > 0 


where cq,ci > 0 are some finite constants. We say that x is w-small in the 
m.s.s. if x £ S(w). 


We employ Corollary 3.3.3, and Definitions 3.3.1 and 3.3.2 repeatedly in 
Chapters 8 and 9 for the analysis of the robustness properties of adaptive 
control systems. 


3.3.2 The C 25 Norm and I/O Stability 

The definitions and results of the previous sections are very helpful in devel- 
oping I/O stability results based on a different norm that are particularly 
useful in the analysis of adaptive schemes. 
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In this section we consider the exponentially weighted £2 norm defined 
as 

\\xths = (^J o e _5( * _r) x T (r)x(r)(irj 

where 5 > 0 is a constant. We say that x G C 25 if || 2<5 exists. When (5 = 0 
we omit it from the subscript and use the notation x G £ 2e - 

We refer to || (-) II 25 as the £ 2 <5 norm. For any finite time t, the £25 norm 
satisfies the properties of the norm given by Definition 3.2.1, i.e., 

(i) ||x t || 2 5 > 0 

(ii) 1 1 aa;* 1 1 2(5 = |cc| 1 1 a;* 1 1 2 <y for any constant scalar a 

(hi) \\{x + y)ths < \\xtbs + Wvths 

It also follows that 

(iv) 1 1 aa;* 1 1 2,5 < 1 1 37* 1 1 25 sup t |a(t)| for any a G £00 

The notion of £25 norm has been introduced mainly to simplify the sta- 
bility and robustness analysis of adaptive systems. To avoid confusion, we 
should point out that the £25 norm defined here is different from the ex- 
ponentially weighted norm used in many functional analysis books that is 

defined as j /q e l5T .T T (r)x(r)(ir| ‘ . The main difference is that this exponen- 
tially weighted norm is a nondecreasing function of t, whereas the £25 norm 
may not be. 

Let us consider the LTI system given by 

y = H(s)u (3.3.12) 

where H(s) is a rational function of s and examine £25 stability , i.e., given 
u G £ 25 , what can we say about the £ p , £ 2< 5 properties of the output y(t) 
and its upper bounds. 

Lemma 3.3.2 Let H(s) in (3.3.12) be proper. IfH(s) is analytic mRe[s] > 
—5/2 for some 5 > 0 and u G be then 

(i) 

WVths < ||tf(s)||oo<5|KI|2<5 
||#(s)||oo <5 = sup H (^jui - 0 


where 
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(ii) Furthermore, when H(s) is strictly proper, we have 

\y(t)\ < Htf(s)lblMb 

where 


l|fl(*)l| 2 S= 1 


\/27T 
T(s 




duj 


The norms ||i/(s) \\ 2 S, ll^toll 00,5 are related by the inequality 

1 




[s + p)H(s)\\ ooS 


for any p > 5/2 > 0 . 


Proof The transfer function H(s ) can be expressed as H(s) = d + H a (s ) with 


h{t) = 


0 t < 0 

d5^(t) + h a it) t> 0 


Because d is a finite constant, H(s) being analytic in Re[s] > — <5/2 implies that 
h a £ C\, i.e., the pair {H(s),h(t)} belongs to the class of functions considered in 
Remark 3.3.1. 

If we define 


{ dS A (t) + e 5 Ph a (t) 


t < 0 
t > 0 


A 


A 


ys(t) = ePy(t) and us(t) = ePu{t), it follows from (3.3.1) that 


ys(t) = / T ^h(t — T)e^ T u(r)dT = hs * us 

Jo 

Now u £ C 2 e => us £ C 2 e- Therefore, applying Lemma 3.3.1 and Remark 3.3.1 for 
the truncated signals yst , ugt at time t and noting that H(s — S /2) is the Laplace 
transform of hs we have 


\\yst h < ll#0 - <V 2 )l|oo|Ki ||2 


(3.3.13) 


Because e **\\yst || 2 = \\yths, e S P\\u S t || 2 = ||«t|| 2 «, and || ii7(s— <5/2) || oo = || -f7(s) ||oo< 5 , 
(i) follows directly from (3.3.13). 

For d = 0, i.e., H(s) is strictly proper, we have 


\y(t)\ < 


< 


pt 

/ e5 (t - T) /i(t-r)e-5 ( ‘- r) u(r)dr 

Jo 

j e s< ' t ~ T) \h(t-T)\ 2 dT^j ||ut|| 2 <5 
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where the second inequality is obtained by applying the Schwartz inequality. Then, 
\y(t)\ < ^ e s ^\h(t-r)\' 2 drj \\u t \\ 2S 

= -j= (/_°° \H(jw-5/2)\ 2 dwJ |K|| 24 (3.3.14) 

where the equality is obtained by assuming that H (s) is strictly proper and applying 
Parseval’s Theorem [42] (p. 236), implies (ii). 
because H(s) is strictly proper, we can write 

-I / nOO -i \ ^ 

ll^OOlhs = -ft=[ \{ju + p 0 )H(ju) - 5/2)\ 2 — — — -dw) 

V2t t \J-oo \ju + Po\ 2 ) 

< - 4 = ( [ sup{\{jw + p 0 )H(ju>- (5/2)1) 

V27T \7_oc |jW+Po|- ) w 

— ^2 — II ( s + Po + ^/2)^(s)||oo« 

for any p 0 > 0. Setting p 0 = p — 5/2, the result follows. □ 


Remark 3.3.3 Lemma 3.3.2 can be extended to the case where H(s) is 
not rational in s but belongs to the general class of transfer functions 
described in Remark 3.3.1. 

We refer to ||iL(s)|| 2 < 5 , ||#(s)||oo<5 defined in Lemma 3.3.2 as the 6 -shifted 
H -2 and Hoc norms, respectively. 

Lemma 3.3.3 Consider the linear time-varying system given by 

X = A(t)x + B(t)u, x(0) = X 0 /QQ-irn 

y = C T (t)x + D(t)u 

where x £ 7Z n ,y £ 7Z r ,u £ 7 Z m , and the elements of the matrices A,B,C, 
and D are bounded continuous functions of time. If the state transition 
matrix 3>(i,r) of (3.3.15) satisfies 

||<&(*, t) || < Aoe-“ 0 ^ (3.3.16) 

for some Ao, «o > 0 and u £ C, 2 e, then for any 5 £ [0, <5i) where 0 < <5i < 2ao 
is arbitrary, we have 
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(0 W)\<Jih\\ u tU+zt 

(ii) \\xths < 


cAp 


y/ (<5i — 5)(2ct 0 — <5i) 

(iii) \\yths < c 0 1 1 ttt 1 1 25 + e t 

where 


25 + e t 


Co = , C ^° sup||C lT (t)|| +sup||£>(t)||,c = sup||g(t)|| 

V(di - d)(2ao - Oi) t t t 

and et is an exponentially decaying to zero term because xq / 0. 

Proof The solution x(t) of (3.3.15) can be expressed as 

x(t) = $(t, 0)a;o + f ®(t,T)B(T)u(T)dT 
Jo 


Therefore, 

Kt)|< 11^,0)111^1 

Using (3.3.16) we have 


||<h(t,r)|| ||B(r)|| \u(r)\dT 


\x(t)\ <e t + cA 0 


=,-« o(t-r) 


u(t)\ dr 


(3.3.17) 


where c and Ao are as defined in the statement of the lemma. Expressing e a °d T 
as e _ ( Qo_ 5)( t_T ) e -2 ( t_r ) and applying the Schwartz inequality, we have 


s(t) I < U + cA 0 (j* e-^-^-^drJ (jf‘ 


r 5(t_T) |w(r)| 2 dr 


< e t 


cAn 


•\72ao — b 


\Ut II 25 


which completes the proof of (i). Using property (iii) of Definition 3.2.1 for the £25 
norm, it follows from (3.3.17) that 


||*U || 2<5 < II II 25 + cAo II || 25 


(3.3.18) 


where 


IIQ t ||25= 




p / r T \ 2 

( / e -a 0 (t— r) w ( r ) ) 

= 

/ e- 5 ^-^ / e- ao(T - s) |n(s)Ms ) dr 

VO / t 

25 

•A) Wo / 
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Using the Schwartz inequality we have 


ao(r— s 


\u(s)\di 


s = 


< 


e -(a 0 -^)(r- s ) e -^(r- S )| u ( s )| ds 


f e -(2oco-Si)(T- S ) ds f e -<5i(r-s)| u ( g )|2^ g 

Jo Jo 


< 


1 

2a 0 — 


p -<5i(r-s)| 


i(s)\ 2 ds 


i.e.. 


IIQi II 25 < 


1 


e -S(t-T ) / p -<5i (r-s) 

V2a 0 — \./o jo 


e '•v '•/ 1 e “iv “qu(g)j 2 (is(ir 


(3.3.19) 


Interchanging the sequence of integration, (3.3.19) becomes 
1 


II Qi II 2<5 < 




1 


— 5t-\-5is 


u(s)| 


>e — (<5i — <5)s g — (5i — 5)i \2 


\/2ao — (5i \J 0 1 i5i — 5 

1 / r* g-<5 (t-s) _ g-<5i(t-s) 

•\/2ao — \jq <5i — <5 


ds 


|w(s)| 2 ds 


< 


1 


\/(2q : 0 — <h)(iii — <5) WO 


— <5(i— s) 


u(s)| 2 ds 


for any £ < 5i < 2a 0 - Because || e * || 25 < e t , the proof of (ii) follows. 

The proof of (iii) follows directly by noting that 

hths < ||(C' T x) t || 2 5 + II (£>'«)( || 2,5 < ||a7* || 2 <5 sup ||C' T (t)|| + I|u t || 2 «sup||£)(t)|| 

4 4 


n 


A useful extension of Lemma 3.3.3, applicable to the case where Aft ) is 
not necessarily stable and 5 = 5q > 0 is a given fixed constant, is given by 
the following Lemma that makes use of the following definition. 

Definition 3.3.3 The pair (C(t),A(t)) in (3.3.15) is uniformly completely 

observable (UCO) if there exist constants /?i , /? 2 , ^ > 0 such that for all 
t-o > 0, 


P 2 I > N(t 0 ,t 0 + u) > Pi I 
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where N(to,to + v) = ft° +u 4> T (r, to)C(r)C T to)dr is the so-called 

observability grammian [1, 201] and <h(t,r) is the state transition matrix 
associated with A(t). 

Lemma 3.3.4 Consider a linear time-varying system of the same form as 

(3.3.15) where ( C(t),A(t . )) is UCO, and the elements ofA,B,C, and D are 
bounded continuous functions of time. For any given finite constant <5o > 0, 
we have 

(i) \x(t)\ < ^-^ (ciIKIImo + C2 1 1 3/t 1 1 2<5 0 ) + P 

(ii) ||®(i)|| 2 i 0 < ^/ ( g 1 _g 0 A ) 1 (2Ql _ (5l) ( Cl ll^ll2<5o + c 2 ||2/i||25 0 ) + ei 

(iii) \\yths 0 < ll^ll25o su Pt ll c ' T (*)ll + IM|2«5 0 su Pi \\D(t)\\ 

where c \ , c 2 > 0 are some finite constants; 6\,oti satisfy do < 5\ < 2a\, and 
et is an exponentially decaying to zero term because xq 0. 

Proof Because (C, A) is uniformly completely observable, there exists a matrix 
K(t) with bounded elements such that the state transition matrix 4> c (t, t) of A c (t) = 
A(t) — K(t)C T (t) satisfies 

l! < I> c (t, r)|| < A 1 e -ai *' t_T - ) 

for some constants a\,Si, Ai that satisfy a\ > 4^ > 4p, Ai > 0. Let us now rewrite 

(3.3.15) , by using what is called “output injection,” as 

x = (A — KC t )x + Bu + KC t x 
Because C T x = y — Du, we have 

x = A c {t)x + Bu + Ky 

where B = B — KD. Following exactly the same procedure as in the proof of 
Lemma 3.3.3, we obtain 

\x{t) | < . Al (ClIMbo + c 2 1 1 2/i 1 1 2<5 0 ) + Ct 

V 2«i — oo 

where C\ = sup t ||i?(t)||, c 2 = sup t \\K(t) || and e t is an exponentially decaying to 
zero term due to a;(0) = xq- Similarly, 

Ai 

y/Jfii ~ <^o)(2q!i — <$i) 


INK < 


(Cl||w t ||2« 0 +C 2 ||yt||25 0 ) + £ t 
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by following exactly the same steps as in the proof of Lemma 3.3.3. The proof of 
(iii) follows directly from the expression of y. □ 

Instead of the interval [0, t), the C- 2 S norm can be defined over any arbi- 
trary interval of time as follows: 

|^,ti||25= e~ S{t - T) x T (T)x(T)dT^J 

for any t{ > 0 and t > t\. This definition allow us to use the properties of 
the C -25 norm over certain intervals of time that are of interest. We develop 
some of these properties for the LTI, SISO system 

x = Ax + Bu , x(0) = xo 

y = C T x + Du (3.3.20) 

whose transfer function is given by 

y = [C T (si - A)~ l B + D\u = H(s)u (3.3.21) 

Lemma 3.3.5 Consider the LTI system (3.3.20), where A is a stable matrix 
and u G T- 2 e- Let a o,Ao be the positive constants that satisfy || e ^u~ T )|| < 
Xoe~ ao ^ t ~ T l . Then for any constant 5 £ [0, <5i) where 0 < <5i < 2«o is 
arbitrary, for any finite t\ > 0 and t>t\ we have 

(a) \x(t) | < A 0 e“ Q °( t " tl )|x(ti) | + ci\\u tjtl \\ 2 s 

(b) Wxt^hs <coe’~^ t ~ tl ' > \x(ti)\+c 2 \\ut t t 1 \\2S 

(h) ||2/t,ti||2<5 < c 3 e~^ t ~ tl '>\x(ti)\ + || -H'(s) || oo^ || || 2 5 

(iii) Furthermore if D = 0, i.e., H(s) is strictly proper, then 

\y(t)\ < c 4 e~ ao(t ~ tl ) | x (ti)| + ||i^(s)|| 2( 5||«t,i 1 ||25 

_ll R ll _ \\B\\Xq 

Cl “ 11 l|C °’ C ° “ C2 “ y/tfi-SWao-S!) 

c 3 =\\C t \\c 0 , c 4 =||C t ||Ao 


where 
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Proof Define v(t) as 


v(t) 


From (3.3.20) we have 


0 if t < t-[ 

u(t) if r > t\ 


x(t) — e A ^‘ *^a;(ti) + x(t) 

Vt > h 

(3.3.22) 

where 

pt 



x(t) = / e A ^ T) Bu(T)dT 

Jtx 

Vt > ti 


We can now rewrite x{t) as 



x(t)= f e A ^Bv{T)dT 

Jo 

Vt > 0 

(3.3.23) 

Similarly 

y{t) = C T e A ^x{t 1 ) + m 

Vt > ti 

(3.3.24) 


y(t) = f C T e AI ' t t ^Bv(t)cIt + Dv(t) Vt > 0 (3.3.25) 

Jo 

It is clear that x in (3.3.23) and y in (3.3.25) are the solutions of the system 


x = Ax + Bv, a:(0) = 0 

y = C T X + Dv (3.3.26) 

whose transfer function is C T (si — A)~ 1 B + D = H{s). 

Because A is a stable matrix, there exists constants Ao , ao > 0 such that 

|| e A(t-r)|| < Aoe -ao(t-r) 


which also implies that H(s) is analytic in Re[s] > — ao- 

Let us now apply the results of Lemma 3.3.3 to (3.3.26). We have 


kWI < 
INI25 < 


\\B HAo 

\j2ao — S 


IHk 


Cl||Ut||25 


II^IIAo 

— S)(2ao — c>i) 


V t \\25 = c 2 ||v* || 2 <5 


for some > 0, S > 0 such that 0 < <5 < <5i < 2a 0 - Because ||v t || 2<5 = ||u t , tl H 25 and 
II 25 < ||xt|| 2 ( 5 , it follows that for all t > t\ 


\x(t)\ < cxWut^hs, < c 2 ||ut,t 1 || 2 (5 (3.3.27) 

From (3.3.22) we have 

|a:(i)| < A 0 e _Qo ^ _tl) |a;(ti)| + |ar(t)| Vt > ti 
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which together with (3.3.27) imply (i)(a). Using (3.3.22) we have 

IKtj 2 « < ||(e A *' t_tl) a:(ti))t j t 1 1 | 25 + \\xt,t x II25 

which implies that 

IKtJb < (/ e “ 5(t “ r)e “ 2ao(T “ tl)dr ) 2A °l a; ( t 1 )l + ll^> t 1 ll 25 

Ane - 2 (i ) 

< % J k(U)l + \\x t , tl hs (3.3.28) 

V2a 0 — 0 

From (3.3.27) and (3.3.28), (i)(b) follows. 

Let us now apply the results of Lemma 3.3.2 to the system (3.3.26), also de- 
scribed by 

y = H(s)v 

we have 

llytlk < ||-H'(s)||ooi||Ui||2«5 

and for H (s) strictly proper 

\m\ < IkMIkIMk 

for any 0 < S < 2a 0 - Since ||i> t || 2< 5 = |K,tilk and WVtjAUs < \\yths, we have 

lift, tl Ik < IkOOlUKtJk (3-3.29) 

and 

|y(i)| < ||ff(s)||2«||ut,t 1 || 2 <5, Vt>t 1 (3.3.30) 

From (3.3.24) we have 

\y{t)\ < ||C ,T ||A 0 e _ao(t_tl) |x(U)| + |y(f)|, Vt > U (3.3.31) 

which implies, after performing some calculations, that 

lift, tl Ik < ||C T H 7 =t== e-i«-^\x(t 1 )\ + \\y tM \\ 2S , Vt > h (3.3.32) 

V 2jQl 0 — 0 

Using (3.3.29) in (3.3.32) we establish (ii) and from (3.3.30) and (3.3.31) we 
establish (iii) . □ 

By taking t\ = 0, Lemma 3.3.5 also shows the effect of the initial condi- 
tion x(0) = .To of the system (3.3.20) on the bounds for \y(t)\ and 1 1 j/t 1 1 2 < 5 - 
We can obtain a similar result as in Lemma 3.3.4 over the interval [t\,t\ 
by extending Lemma 3.3.5 to the case where A is not necessarily a stable 
matrix and 5 = > 0 is a given fixed constant, provided (C, A) is an 

observable pair. 
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Lemma 3.3.6 Consider the LTV system (3.3.15) where the elements of 
Aft), B(t), C(t), and D(t ) are bounded continuous functions of time and 
whose state transition matrix $(t,r) satisfies 

mt,r)\\ < X 0 e- a ^ 

\/t > t and t,r £ [ti , ^ 2 ) for some t 2 > ti > 0 and cto, Ao > 0. Then for any 
5 £ [0, <5i) where 0 < <5i < 2ao is arbitrary, we have 

(i) \x{t)\ < A 0 e- QO ( t - tl )|x(ti)| + V 2 a°_ s \\ u t, tl h 6 

(ii) Kftlh s < ^fa e - f( ‘-‘ 1> l^ 1 )l + v / ( ,,_g l L,_, l) »”,,„ll 2 ^,Vt e [ti ,t 2 ) 

where c = sup t ||i?(t)||. 

Proof The solution x(f) of (3.3.15) is given by 

x(t) = <f>(t, ti)x(ti) + f <Ht,T)B(r)u(r)dT 
Jt 1 

Hence, 

|z(t)| < A 0 e- a °( t -* 1 )|x(ti)| + cA 0 [ e -«o(t-r)| u ( T )| dT 

■It 1 

Proceeding as in the proof of Lemma 3.3.3 we establish (i). Now 

IKtJk < A 0 |x(H)| ( [* + cX 0 \\Q t , 


;,ti||25 


lift, til 


A 

28 = 


o -a 0 (t-T) 


u(r)\dT 


28 


't 1 / t,ti | 

Following exactly the same step as in the proof of Lemma 3.3.3 we establish that 

1 


llft.ti ||25 < 


\/(2ao — <^i) (f5i — <5) 


|Wt, tl ||25 


Because 


Aokft)J_ e _4(t_ t i) 

•\/2q;o — 5 


IKtilb< ^^AP=e-^-^ + \\Q tM hs 


the proof of (ii) follows. 


□ 


Example 3.3.3 (i) Consider the system described by 


y = H(s)u 
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where H(s) = ^ 3 . We have 


|-ffO)||oo<5 = SUP 


jw + 3 - 


and 


^ (/_ 


, ,2 I (6— <5) : 
— 00 uj -\ - A — 


6-5’ 


-duj = 


VS G [0, 6 ) 


\/6 — S 


, We [0,6) 


For u(t) = 1 ,Vt > 0, we have y(t) = |(1 — e 3t ), which we can use to verify 
inequality (ii) of Lemma 3.3.2, i.e., 


\y(t)\ 


tv 1 1 ^ e 


—3* I 


< 


\/6 — S 


(l- e~ st 


1 



holds Vt G [0, 00 ) and S G (0, 6 ). 

(ii) The system in (i) may also be expressed as 


y = -3 y + 2 u, y( 0 ) = 0 


Its transition matrix <I>(t, 0) = e 34 and from Lemma 3.3.3, we have 
\y(t)\ < -f==\\uths, VS G [0, 6 ) 

For u(t) = l,Vt > 0, the above inequality implies 


\y(t)\ 


|1 — e 


—3* I 


< 


\/6 — (5 


1 — e 


—St 


which holds for all 5 G (0,6). 


V 


3.3.3 Small Gain Theorem 

Many feedback systems, including adaptive control systems, can be put in 
the form shown in Figure 3.1. The operators H \ , H 2 act on ei,e 2 to pro- 
duce the outputs yi,y 2 ] u\,U 2 are external inputs. Sufficient conditions for 
Hi , H 2 to guarantee existence and uniqueness of solutions e\,y\,e 2 ,V 2 for 
given inputs U\,U 2 in C pe are discussed in [42], Here we assume that H \ , H 2 
are such that the existence and uniqueness of solutions are guaranteed. The 
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Figure 3.1 Feedback system. 


problem is to determine conditions on Hi,H2 so that if U\,U2 are bounded 
in some sense, then ei,e2,yi,V2 are also bounded in the same sense. 

Let £ be a norrned linear space defined by 

C= {f:K + ^K n | Il/H < oo} 


where || • |j corresponds to any of the norms introduced earlier. Let C e be 
the extended norrned space associated with C, i.e., 


C e = {f:K + ^K n \\\f t \\ <oo,Vt eU + } 


where 


ft(r) 


f(r) t < t 
0 T > t 


The following theorem known as the small gain theorem [ 42 ] gives sufficient 
conditions under which bounded inputs produce bounded outputs in the 
feedback system of Figure 3 . 1 . 


Theorem 3.3.3 Consider the system shown in Figure 3.1. Suppose H\, 
H- 2 '. C e •-» £ e ; ei,e2 G C e . Suppose that for some constants 71, 72 > 0 and 
Pi, fh, the operators Hi,H -2 satisfy 


Vt G K+. If 


||(Hiei)t|| < 71 1| eit || + Pi 
|| (FT 2 e 2 )t || < 72 1 1 e 2 * 1 1 + P 2 

7172 < 1 


then 
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(i) ||eit|| < (I-7172) 1 (H'unll + 72 ||M 2 t|| + P2 + 72 A) 

||e 2 *|| < ( 1 -7i72) _1 (||^2t|| +7i||-uu|| +/ 3 i +7 i/?2) ( 3 . 3 . 33 ) 

for any t > 0 . 

(ii) If in addition, ||«i ||, ||t*2 1 | <00, then e\, 2/1 , 2/2 have finite norms, and 

the norms of e i,e2 are bounded by the right-hand sides of ( 3 . 3 . 33 ) with 
all subscripts t dropped. 

The constants 71, 72 are referred to as the gains of H\, H2 respectively. When 
U2 = 0 , there is no need to separate the gain of H\ and H2. In this case, one 
can consider the “loop gain ” H2H1 as illustrated by the following corollary: 

Corollary 3 . 3.4 Consider the system of Figure 3.1 with U2 = 0 . Suppose 
that 

\\{H 2 H 1 e{)t\\ < 721 1 | en || + P21 
||(#iei)t|| < 7i||eu|| + ( 3 i 

Vt £ U + for some constants 721,71 > 0 and /?2i , / 5 i - If 721 < 1 , then 

(i) 

||eit|| < (1 - 72i) _1 (ll^nll +/?2i) 

llyitll < 7 i(! ~ 72 i)^ 1 (\\uit\\ +/?2i) + Pi ( 3 . 3 . 34 ) 

for any t > 0 . 

(ii) If in addition ||ui|| < 00, then ei, e2, yi, 2/2 have finite norms and 
( 3 . 3 . 34 ) holds without the subscript t. 

The proofs of Theorem 3 . 3.3 and Corollary 3 . 3.4 follow by using the prop- 
erties of the norm [ 42 ] . 

The small gain theorem is a very general theorem that applies to both 
continuous and discrete-time systems with multiple inputs and outputs. 

As we mentioned earlier, Theorem 3 . 3.3 and Corollary 3 . 3.4 assume the 
existence of solutions ei,e2 £ C e . In practice, u\ , U2 are given external 
inputs and ei,e2 are calculated using the operators H\ , H2 ■ Therefore, the 
existence of e\, e2 £ C e depends on the properties of H\,H2. 


Example 3 . 3.4 Let us consider the feedback system 
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where G(s) = ^ ^ , a > 0 is a constant time delay, and K is a constant feedback 
gain. The external input r is an input command, and d is a noise disturbance. We 
are interested in finding conditions on the gain K such that 

(i) r,d€ Coo => ei,e 2 ,y € Coo 

(ii) r,de £ 2 => e lt e 2 , y C C 2 

The system is in the form of the general feedback system given in Figure 3.1, i.e. , 


u\ = r,u 2 = d 


H\ei(t) = f e 2a e 2 ^ 4 r ^ei (r)dr 

J o 

H 2 e 2 (t) = Ke 2 (t) 

where e _2 b~“) for t > a comes from the impulse response g(t ) of G(s), i.e., g(t) = 
e -2 (t-a) £ or ^ > a anc [ = 0 for t < a. 

(i) Because 

\H 1 e 1 (t)\ < e 2a [ e _2(t_r) |ei(r)|dT 

Jo 

< e 2a [ e~ 2it ~ T) dT\\e lt \\oo 

Jo 

— 2 1 1 eif 1 1 00 

we have 71 = Similarly, the Hoc-gain of H 2 is j 2 = \K\. Therefore, for Hoc- 
stability the small gain theorem requires 

\K\ , , 

< 1 , i-e., \K\ < 2 

(ii) From Lemma 3.3.1, we have 


\\{Hiei) t \\ 2 < sup 

CJ 


e -<X]U 

2 + ju> 


lleulb 


1 

2 


|| ei* || 2 


which implies that the C 2 gain of Hi is 71 = \. Similarly the C 2 gain of H 2 is 
72 = \K\, and the condition for H2-stability is \K\ < 2. 
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For this simple system, however, with a. = 0, we can verify that r,d € Coo =>• 
ei,e2 ,y £ Coo if and only if K > —2, which indicates that the condition given by 
the small gain theorem (for a = 0) is conservative. V 

Example 3.3.5 Consider the system 

x = A c x, A c = A + B 

where x £ lZ n , A is a stable matrix, i.e., all the eigenvalues of A are in Re[s] < 0 
and B is a constant matrix. We are interested in obtaining an upper bound for B 
such that A c is a stable matrix. Let us represent the system in the form of Figure 
3.1 as the following: 


Hi 



h 2 


We can verify that the Coo gain of H\ is 7i = ^ where ai, cto > 0 are the constants 
in the bound ||e" 4 d- r )|| < aie _ “°h _T ) that follows from the stability of A. The Coo 
gain of H 2 is 72 = ||.B||. Therefore for Coo stability, we should have 


ll*|| < ^ 

CX 1 

Now Coo stability implies that A c = A + B is a stable matrix. (Note that the initial 
condition for x is taken to be zero, i.e., x(0) = 0.) V 


Despite its conservatism, the small gain theorem is widely used to design 
robust controllers for uncertain systems. In many applications, certain loop 
transformations are needed to transform a given feedback system to the form 
of the feedback system of Figure 3.1 where H \ , H 2 have finite gains [42]. 
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3.3.4 Bellman-Gronwall Lemma 

A key lemma for analysis of adaptive control schemes is the following. 

Lemma 3.3.7 (Bellman-Gronwall Lemma I) [232] Let X(t),g(t),k(t) 
be nonnegative piecewise continuous functions of time t. If the function y(t) 
satisfies the inequality 

y(t ) < A (t) + g{t ) f k(s)y(s)ds , Vf > to > 0 (3.3.35) 

Jtn 


y(t) < X (t) + g(t) J A (s)k(s) exp (^J k(T)g(T)d.T^j d-s Vt > to > 0 

(3.3.36) 

In particular, if X(t) = A is a constant and g(t) = 1, then 
y(t ) < A exp k(s)ds' S j Vf > to > 0 


Proof Let us define 


q(t)^k(t)e-f'o 9iT)k{T)dT 


Because k(t) is nonnegative, we have q(t) > 0 Vf > to- Multiplying both sides of 
(3.3.35) by q(t), and rearranging the inequality we obtain 


q(t)y(t) - q(t)g(t ) / k(s)y(s)ds < A (t)q(t) 


(3.3.37) 


From the expression of q{t), one can verify that 


d f - f* g(r)k(T)dT 

= — I e Jt o 


k(s)y(s)ds I (3.3.38) 


q(t)y(t) - q(t)g(t) / k(s)y(s)ds 


Using (3.3.38) in (3.3.37) and integrating both sides of (3.3.37), we obtain 


s ft 0 > I l f < f \ (s)q(s)ds 

Jtn Jtn 


Therefore, 


f k(s)y(s)ds < e h 0 9 ^ k ^ dT f X(s)q(s)ds 

Jto Jto 

= Ji 9(T)k{T)dT f X{s)k{s)e~L 

Jto 

= [‘ X (s)k(s)efs 9{T)k{T)dT ds 


-I g(r)k(r)dT 

Jt o ds 


(3.3.39) 
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Using (3.3.39) in (3.3.35), the proof for the inequality (3.3.36) is complete. 
Consider the special case where A is a constant and g = 1. Define 


From (3.3.35), we have 


Now 


Because k > 0, we have 


A ft 

qi = A + / k(s)y(s)ds 
Jt 0 

y(t) < <n{t) 

= ky 
qi < kqi 


Let w = qi — kq±. Clearly, w < 0 and 

qi = kqi + w 

which implies 

P k(r)dT 

= e Jt o t 


qi{t) = e J 




J k(s)ds 


w(t)(1t 


(3.3.40) 


Because k > 0,w < 0 Vt > to and Qi(to) — it follows from (3.3.40) that 


y(t) < qi(t) < Xe^o HT)dT 


and the proof is complete. □ 

The reader can refer to [32, 232] for alternative proofs of the B-G Lemma. 
Other useful forms of the B-G lemma are given by Lemmas 3.3.8 and 3.3.9. 

Lemma 3.3.8 (B-G Lemma II) Let A (t),k(t) be nonnegative piecewise 
continuous function of time t and let A (t) be differentiable. If the function 
y(t) satisfies the inequality 

y(t ) < A (t) + f k(s)y(s)ds, Vt > to > 0 

Jto 

y(t) < X(to)e^o k ^ s ^ ds 4. I A(s)e-^ k ^ dT ds, Vt > to > 0. 

Jt 0 


then 
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Proof Let 

z(t) = A (t) + / k(s)y(s)ds 
Jt o 

it follows that z is differentiable and z > y. We have 

i = A + ky, z(t 0 ) = A (t 0 ) 

Let v = z — y, then 


z = A + kz — kv 


whose state transition matrix is 


$(t,r)=exp J k(s)ds 


Therefore, 


Because 


z(t) = $(t,to)z(t 0 ) + / ^(t,r)[A(r) - k(r)v(r)]dr 

Jto 


$(t, T)fc(r)u(r)dr > 0 


h 0 


resulting from <3>(t, r), k(r), v(t) being nonnegative, we have 

z(t) <®(t,t 0 )z(t 0 ) + I < L(Lr)A(r)dr 
Jt 0 

Using the expression for d>(t,to) in the above inequality, we have 

y(t) < z(t) < A(t 0 )e^' t o ^ + f A (s)e^ k( ' T ^ dT ds 

Jt 0 

and the proof is complete. 


□ 


Lemma 3.3.9 (B-G Lemma III) Let cq, c\, C2, a be nonnegative constants 
and k(t) a nonnegative piecewise continuous function of time. Ify(t ) satisfies 
the inequality 

y(t ) < c 0 e _Q ^ o) + ci + c 2 [ e~ a{t ^k(T)y(T)dT, Vi > t 0 

Jtn 


then 


y(t) < (co + Ci)e to )g C2 ft 0 k ( s ) ds _|_ c ^ a f e a(t r) e c 2 f T k(s)ds^ T ^ \/t > t() 

Jtn 
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Proof The proof follows directly from Lemma 3.3.8 by rewriting the given inequal- 
ity of y as 

y{t) < A (t) + f k{r)y{r)dT 
Jt 0 

where y(t) = e at y(t),k(t ) = C2k(t),\(t) = coe“ 4 ° + Cie Qt . Applying Lemma 3.3.8, 
we obtain 

at ^ t i \ atn 02 f, k(s)ds f ar c 2 f k(s)ds , 

e y(t) < (co + Ci)e °e Jt o +cia / e e dr 

Jt 0 


The result follows by multiplying each side of the above inequality by e 


□ 


The B-G Lemma allows us to obtain an explicit bound for y(t) from the 
implicit bound of y(t) given by the integral inequality (3.3.35). Notice that 
if y(t) > 0 and A (t) = 0 Vi > 0, (3.3.36) implies that y(t) = 0 Vi > 0. 

In many cases, the B-G Lemma may be used in place of the small gain 
theorem to analyze a class of feedback systems in the form of Figure 3.1 as 
illustrated by the following example. 

Example 3.3.6 Consider the same system as in Example 3.3.5. We have 
x(t) = e At x(0) + f e A{t ~ T) Bx(T)dT 

Jo 

Hence, 

|x(f)|<aie-“ ot K0)|+ f a 1 e _Q °^ _r1 ||13|||a;(r)|dr 

Jo 

i.e., 

|x(t)| < aie _ “ ot |a;(0)| + aie _ao *||H| f e a ° r |a;(T)|dr 

Jo 

Applying the B-G Lemma I with A = aie _ “ ot |a;(0)|, g{t) = ai\\B\\e~ aot , k(t) = e“ ot , 
we have 

|x(f)| < aie _ “ ot |a;(0)| + ai|a;(0)|e _7t 

where 

7 = a 0 - Q'i||H|| 

Therefore, for |x(t)| to be bounded from above by a decaying exponential (which 
implies that A c = A + B is a stable matrix), B has to satisfy 
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which is the same condition we obtained in Example 3.3.5 using the small gain 
theorem. In this case, we assume that |x(0)| yt 0, otherwise for x(0) = 0 we would 
have A (t) = 0 and |x(f)| = 0 Vt > 0 which tells us nothing about the stability of 
A c . The reader may like to verify the same result using B-G Lemmas II and III. V 


3.4 Lyapunov Stability 

3.4.1 Definition of Stability 

We consider systems described by ordinary differential equations of the form 

x = f(t,x), x(t 0 ) = x o (3.4.1) 

where x G H n , f : JxB(r) J = [to, oo) and B(r) = {x£lZ n | |x| < r}. 
We assume that / is of such nature that for every xq G B(r) and every 
to £ 7Z + , (3.4.1) possesses one and only one solution x(t; to, xq). 

Definition 3.4.1 A state x e is said to be an equilibrium state of the 

system described by (3.4-1) if 


f(t,x e ) = 0 for all t > to 

Definition 3.4.2 An equilibrium state x e is called an isolated equilib- 
rium state if there exists a constant r > 0 such that B(x e ,r) = {x \ 
\x — x e \ < r} C IZ n contains no equilibrium state of (3.4-1) other than 

X e - 

The equilibrium state x\ e = 0, X 2 e = 0 of 

X\ = X\X 2 , X'2 = x\ 


is not isolated because any point x\ = 0, x '2 = constant is an equilibrium 
state. The differential equation 

x = (x — l) 2 x 

has two isolated equilibrium states x e = 1 and x e = 0. 
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Definition 3.4.3 The equilibrium state x e is said to be stable (in the 
sense of Lyapunov) if for arbitrary to and e > 0 there exists a 5(e,to) 
such that |xo — x e \ < 6 implies \x(t] to, xq) — x e \ < e for all t > to- 

Definition 3.4.4 The equilibrium state x e is said to be uniformly stable 
(u.s.) if it is stable and if 5 {e, to) in Definition 3-4-3 does not depend on to- 

Definition 3.4.5 The equilibrium state x e is said to be asymptotically 
stable (a.s.) if (i) it is stable, and (ii) there exists a 5(to) such that |a:o — 
x e \ < 5(to) implies lim^oo \x(t;to,xo) — x e \ = 0. 

Definition 3.4.6 The set of all xq G 7 Z n such that x(t;to,xo) —> x e as 
t — > oo for some to > 0 is called the region of attraction of the equilibrium 
state x e . If condition (ii) of Definition 3-4-5 is satisfied, then the equilibrium 
state x e is said to be attractive. 

Definition 3.4.7 The equilibrium state x e is said to be uniformly asymp- 
totically stable (u.a.s.) if (i) it is uniformly stable, (ii) for every e > 0 and 
any t-o G 1Z + , there exist a 6 o > 0 independent of to and e and a T(e) > 0 
independent of to such that \x(t] to, .To) — x e \ < e for all t. > t-o + T(e ) 
whenever \xo — x e \ < 6 q- 

Definition 3.4.8 The equilibrium state x e is exponentially stable (e.s.) 

if there exists an a > 0 , and for every e > 0 there exists a 6(e) >0 such 
that 

\x(t;to,xo) — x e \ < for all t > to 

whenever |xo — x e \ < 6(e). 

Definition 3.4.9 The equilibrium state x e is said to be unstable if it is 
not stable. 

When (3.4.1) possesses a unique solution for each xq G VJ' and to G 1Z + , 
we need the following definitions for the global characterization of solutions. 

Definition 3.4.10 A solution x(t;to,xo ) of (3-4-1) is bounded if there 
exists a (3 > 0 such that |a:(t; to, xo)| < /3 for all t > to, where (3 may depend 
on each solution. 
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Definition 3.4.11 The solutions of (3. 4-1) are uniformly bounded (u.h.) 
if for any a > 0 and to £ hl + , there exists a (3 = /3(a) independent of to 
such that if |xo| < a, then |x(t;to,xo)| < (3 for all t > to- 

Definition 3.4.12 The solutions of (S.f.l) are uniformly ultimately bo- 
unded (u.u.b.) (with bound B) if there exists a B > 0 and if corresponding 
to any a > 0 and to £ 1Z + , there exists aT = T(a) > 0 (independent of to) 
such that |xo| < a implies |x(t; to, xo)| < B for all t>to + T. 

Definition 3.4.13 The equilibrium point x e of (S.f.l) is asymptotically 
stable in the large (a.s. in the large) if it is stable and every solution 
of (3.4-1) tends to x e as t —> 00 (i.e., the region of attraction of x e is all of 
IZ n ). 

Definition 3.4.14 The equilibrium point x e of (3. 4-1) is uniformly asymp- 
totically stable in the large (u.a.s. in the large) if (i) it is uniformly 
stable, (ii) the solutions of (3. 4-1) are uniformly bounded, and (in) for any 
a > 0, any e > 0 and to £ H + , there exists T(e,a) > 0 independent of to 
such that if |xo — x e \ < a then |x(t; t-o, xo) — x e \ < e for all t > to + T(e, a). 

Definition 3.4.15 The equilibrium point x e of (3. 4-1) is exponentially 
stable in the large (e.s. in the large) if there exists a > 0 and for any 

(3 > 0, there exists k((3 ) > 0 such that 

\x{t] to, xo)| < k{(3)e~ a<yt ~ to ^ for all t > to 
whenever |xo| < (3. 

Definition 3.4.16 If x(t;to,xo) is a solution of x = f(t,x), then the tra- 
jectory x(t;to,xo ) is said to be stable (u.s., a.s., u.a.s., e.s., unstable) 
if the equilibrium point z e = 0 of the differential equation 

z = f(t , ^ + x(t; t 0 , x 0 )) - /(t, x(t; t 0 , x 0 )) 

is stable (u.s., a.s., u.a.s., e.s., unstable, respectively). 

The above stability concepts and definitions are illustrated by the fol- 
lowing example: 
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Example 3.4.1 

(i) x = 0 has the equilibrium state x e = c, where c is any constant, which is not 
an isolated equilibrium state. It can be easily verified that x e = c is stable, 
u.s. but not a.s. 

(ii) x = — x 3 has an isolated equilibrium state x e = 0. Its solution is given by 


x(t) = x(t; t 0 ,x 0 ) 


1 + 2 xl(t - t 0 ) 


Now given any e > 0, |xq| < <5 = e implies that 


|a(t)l = V l + 24 (t-t 0 ) * ^ <^t>t o >0 (3.4.2) 

Hence, according to Definition 3.4.3, x e = 0 is stable. Because 6 = e is 
independent of t 0 , x e = 0 is also u.s. Furthermore, because x e = 0 is stable 
and x{t) — » x e = 0 as t — > oo for all xo € 7 Z, we have a.s. in the large. Let 
us now check whether x e = 0 is u.a.s. in the large by using Definition 3.4.14. 
We have already shown u.s. From (3.4.2) we have that x(t) is u.b. To satisfy 
condition (iii) of Definition 3.4.14, we need to find a T > 0 independent of to 
such that for any a > 0 and e > 0, |xo| < a implies |x(t)| < e for all t > to + T. 
From (3.4.2) we have 


|x(*)l ^ |x(*o +T)\ = \I TT2tfT < \/t > to + T 

Choosing T = 2 )-, , it follows that |x(t) | < e Vt > t 0 + T. Hence, x e = 0 is u.a.s. 
in the large. Using Definition 3.4.15, we can conclude that x e = 0 is not e.s. 

(iii) x = (x — 2)x has two isolated equilibrium states x e = 0 and x e = 2. It can be 
shown that x e = 0 is e.s. with the region of attraction R a = {x \ x < 2} and 
x e = 2 is unstable. 

(iv) x = — yryX has an equilibrium state x e = 0 that is stable, u.s., a.s. in the large 
but is not u.a.s. (verify). 

(v) x = (t sin t — cos t — 2)x has an isolated equilibrium state x e = 0 that is stable, 

a.s. in the large but not u.s. (verify). V 


3.4.2 Lyapunov’s Direct Method 

The stability properties of the equilibrium state or solution of (3.4.1) can 
be studied by using the so-called direct method of Lyapunov (also known 
as Lyapunov’s second method) [124, 125]. The objective of this method is 
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to answer questions of stability by using the form of f(t,x ) in (3.4.1) rather 
than the explicit knowledge of the solutions. We start with the following 
definitions [143]. 

Definition 3.4.17 A continuous function p : [0, r] i— > R + (or a continuous 
function p : [0, oo) i— > IZ + ) is said to belong to class 1C, i.e., p £ 1C if 

(i) </?(0) = 0 

(ii) p is strictly increasing on [0, r] (or on [0,oo)j. 

Definition 3.4.18 A continuous function p : [0, oo) i— > R + is said to belong 
to class 1C1Z, i.e., <p G ICR if 

(i) ¥7(0) = 0 

(ii) p is strictly increasing on [0, oo) 

(iii) Hindoo p(r) = oo. 

2 

The function ¥>(|cc|) = belongs to class 1C defined on [0, oo) but not to 
class ICR. The function (^(M) = \ x \ belongs to class 1C and class ICR. It is 
clear that <p £ ICR implies p £ 1C, but not the other way. 

Definition 3.4.19 Two functions p\,p 2 G £ defined on [0,r] (or on[ 0, oo),) 
are said to be of the same order of magnitude if there exist positive 
constants k\ .k 2 such that 

ki<pi(i ’i) < <P 2 (n) < k 2 <pi(ri), Vn G [0,r] ( or Vri G [0,oo)) 

2 2 

The function (/?i(|a;|) = an d T 2 = I+T 1 are the same order of 

magnitude (verify!). 

Definition 3.4.20 A function V(t,x) : R + x B(r) h- > R with V(t,0) = 
0 Vt G R + is positive definite if there exists a continuous function p G 1C 
such that V[t,x) > ¥>(|a;|) Vt G R + , x G B(r) and some r > 0. V(t,x) is 
called negative definite if —V(t,x) is positive definite. 

2 

The function V(t,x) = with x G B( 1) is positive definite, whereas 

V(t, x) = ppiX 2 is not. The function V{t,x) = is positive definite for 
all x G R. 
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Definition 3.4.21 A function V(t,x) : IZ + x B(r) ^ 1Z with V(t, 0) = 
0 Vf G 1Z + is said to be positive (negative) semidefinite ifV(t,x) > 0 
(V(t,x) < 0) for all t G IZ + and x G B(r) for some r > 0. 

Definition 3.4.22 A function V(t,x) : IZ + x B(r) h- >■ 1 Z with V(t,0) = 
0 \/t G 1Z + is said to be decrescent if there exists tp € K. such that \ V(t, x) \ < 
<^(|x|) Vi > 0 and Vx G B(r) for some r > 0. 

The function V(t,x) = j + t x 2 is decrescent because V(t,x) = 1 + t x 2 < 
x 2 Vi G 1Z + but V(t,x ) = tx 2 is not. 

Definition 3.4.23 A function V (t, x) : IZ + x lZ n > IZ with V (t, 0) = 0 Vi G 
1Z + is said to be radially unbounded if there exists p G KHZ such that 
V{t,x) > </?(|x|) for all x G 7 Z n and t G 1Z + . 

2 

The function V(x) = satisfies conditions (i) and (ii) of Definition 3.4.23 

(i.e., choose </?(|x|) = ypjyp)- However, because V(x) < 1, one cannot find a 
function </?(|x|) G KHZ to satisfy V(x) > <^(|x|) for all x G IZ n . Hence, V is 
not radially unbounded. 

It is clear from Definition 3.4.23 that if V(t,x) is radially unbounded, 
it is also positive definite for all x G 7 Z n but the converse is not true. The 
reader should be aware that in some textbooks “positive definite” is used for 
radially unbounded functions, and “locally positive definite” is used for our 
definition of positive definite functions. 

Let us assume (without loss of generality) that x e = 0 is an equilibrium 
point of (3.4.1) and define V to be the time derivative of the function V(t, x) 
along the solution of (3.4.1), i.e., 

V= ^ + (VL) T/ (t,x) (3.4.3) 

where VV = [J^-, J^, . . . , ^~] T is the gradient of V with respect to x. The 
second method of Lyapunov is summarized by the following theorem. 

Theorem 3.4.1 Suppose there exists a positive definite function V(t,x) : 
7Z + xB(r) i— > 7 Z for some r > 0 with continuous first- order partial derivatives 
with respect to x, t, and V ( t , 0) = 0 Vi G 7Z + . Then the following statements 
are true: 
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(i) //’ V < 0, then x e = 0 is stable. 

(ii) If V is decrescent and V < 0, then x e = 0 is u.s. 

(iii) If V is decrescent and V < 0, then x e is u.a.s. 

(iv) If V is decrescent and there exist <p\,<P 2 i<P 3 G JC of the same order of 
magnitude such that 

<MM) < V(t,x) < <P 2 {\x\), V(t,x) < -^ 3 (M) 

for all x G B(r) and t G IZ + , then x e = 0 is e.s. 

In the above theorem, the state x is restricted to be inside the ball B(r) 
for some r > 0. Therefore, the results (i) to (iv) of Theorem 3.4.1 are 
referred to as local results. Statement (iii) is equivalent to that there exist 
<PU<P 21 <P 3 G /C, where <p\,<P 2 i<Pz do not have to be of the same order of 
magnitude, such that </?i(|x|) < V(t,x) < <p 2 (\x\),V(t,x) < — <£> 3 (|x|). 

Theorem 3.4.2 Assume that (3-4.1) possesses unique solutions for all xo G 
IZ n . Suppose there exists a positive definite, decrescent and radially un- 
bounded function V(t,x) : 7 Z + xIZ n i— > IZ + with continuous first-order partial 
derivatives with respect to t,x and V(t,0) = 0 Vt G 1Z + . Then the following 
statements are true: 

(i) If V <0, then x e = 0 is u.a.s. in the large. 

(ii) If there exist tpi,<p 2 i<P 3 £ /C IZ of the same order of magnitude such that 

<MW) < v(t,x) < <p 2 (\x I), V(t,x) < -(^ 3 (kl) 

then x e = 0 is e.s. in the large. 

Statement (i) of Theorem 3.4.2 is also equivalent to that there exist 
<Pi,<P 2 G K, and tps G KHZ such that 

^i(kl) < V(t,x) < <P 2 {\x\), V(t,x) < — V7 3 (|cc|), Vx G 7 Z n 

For a proof of Theorem 3.4.1, 3.4.2, the reader is referred to [32, 78, 79, 
97, 124], 

Theorem 3.4.3 Assume that (3. 4-1) possesses unique solutions for all xq G 
7 Z n . If there exists a function V(t,x) defined on |x| > R (where R may 
be large) and t G [0, oo) with continuous first-order partial derivatives with 
respect to x, t and if there exist <pi,ip 2 £ KHZ such that 
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(i) ¥>i(|®|) < V(t,x) < <p 2 (\x\) 

(ii) V(t, x) < 0 

for all |a:| > R and t G [0, oo), then, the solutions of (3. f.l) are u.b. If in 
addition there exists ps G K. defined on [0, oo) and 

(iii) V(t,x) < — y> 3 (|x|) for all |a:| > R and t G [0,oo) 
then, the solutions of (3.4-1) are u.u.b. 

Let us examine statement (ii) of Theorem 3.4.1 where V decrescent and 
V < 0 imply x e = 0 is u.s. If we remove the restriction of V being decrescent 
in (ii), we obtain statement (i), i.e., V < 0 implies x e = 0 is stable but not 
necessarily u.s. Therefore, one might tempted to expect that by removing 
the condition of V being decrescent in statement (iii), we obtain x e = 0 is 
a.s., i.e., V < 0 alone implies x e = 0 is a.s. This intuitive conclusion is 
not true, as demonstrated by a counter example in [206] where a first-order 
differential equation and a positive definite, nondecrescent function V(t,x) 
are used to show that V < 0 does not imply a.s. 

The system (3.4.1) is referred to as nonautonomous. When the function 
/ in (3.4.1) does not depend explicitly on time t, the system is referred to 
as autonomous. In this case, we write 

x = f(x) (3.4.4) 

Theorem 3.4.1 to 3.4.3 also hold for (3.4.4) because it is a special case of 
(3.4.1). In the case of (3.4.4), however, V(t,x) = V(x), i.e., it does not 
depend explicitly on time t, and all references to the word “decrescent” and 
“uniform” could be deleted. This is because V(x) is always decrescent and 
the stability (respectively a.s.) of the equilibrium x e = 0 of (3.4.4) implies 
u.s. (respectively u.a.s.). 

For the system (3.4.4), we can obtain a stronger result than Theo- 
rem 3.4.2 for a.s. as indicated below. 

Definition 3.4.24 A set id in 7 Z n is invariant with respect to equation 
(3.4-4) if every solution of (3-4-4) starting in id remains in fl for all t. 

Theorem 3.4.4 Assume that (3-4-4) possesses unique solutions for all xq G 
IZ n . Suppose there exists a positive definite and radially unbounded function 
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V(x) : IZ n i— ► IZ + with continuous first-order derivative with respect to x and 
F(0) = 0. If 

(i) v < o Vx € n n 

(ii) The origin x = 0 is the only invariant subset of the set 

n = {x g n n \v = o} 

then the equilibrium x e = 0 of (3.4-4) a - s ■ the large. 

Theorems 3.4.1 to 3.4.4 are referred to as Lyapunov-type theorems. The 
function V (f , x) or V (x) that satisfies any Lyapunov-type theorem is referred 
to as Lyapunov function. 

Lyapunov functions can be also used to predict the instability properties 
of the equilibrium state x e . Several instability theorems based on the second 
method of Lyapunov are given in [232], 

The following examples demonstrate the use of Lyapunov’s direct method 
to analyze the stability of nonlinear systems. 

Example 3.4.2 Consider the system 

xi = x 2 + cxi(xj + xl) , . 

X 2 = — Xi + CX 2 (xf + x\) 

where c is a constant. Note that x e = 0 is the only equilibrium state. Let us choose 

V (x) = x i + x 2 

as a candidate for a Lyapunov function. V(x) is positive definite, decrescent, and 
radially unbounded. Its time derivative along the solution of (3.4.5) is 

V = 2c{xi+xlf (3.4.6) 

If c = 0, then V = 0, and, therefore, x e = 0 is u.s.. If c < 0, then V = — 2|c|(x^+x|) 2 

is negative definite, and, therefore, x e = 0 is u.a.s. in the large. If c > 0, x e = 0 is 

unstable (because in this case V is strictly increasing Vi > 0), and, therefore, the 
solutions of (3.4.5) are unbounded [232]. V 
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Example 3.4.3 

pendulum 


Consider the following system describing the motion of a simple 


X\ = x 2 
x 2 = — fcsinaq 


(3.4.7) 


where fc > 0 is a constant, X\ is the angle, and x 2 the angular velocity. We consider a 
candidate for a Lyapunov function, the function V ( x ) representing the total energy 
of the pendulum given as the sum of the kinetic and potential energy, i.e., 


1 f Xl 1 

V (x) = -X2 + k sin 77 dr] = -x\ + k( 1 — cos £ 1 ) 

2 " Jo 2 

V(x) is positive definite and decrescent \/x € B(n) but not radially unbounded. 
Along the solution of (3.4.7) we have 

V = 0 


Therefore, the equilibrium state x e = 0 is u.s. 


V 


Example 3.4.4 Consider the system 


Xl = x 2 

x 2 = —x 2 — e“*a;i 


(3.4.8) 


Let us choose the positive definite, decrescent, and radially unbounded function 

V (. x ) = x\ + x\ 

as a Lyapunov candidate. We have 

V = —2x\ + 2 xix 2 (1 — e -t ) 

Because for this choice of V function neither of the preceding Lyapunov theorems 
is applicable, we can reach no conclusion. So let us choose another V function 

V(t,x) = x\ + e t X 2 


In this case, we obtain 

V(t,x) = —e t x\ 


This V function is positive definite, and V is negative semidefinite. Therefore, 
Theorem 3.4.1 is applicable, and we conclude that the equilibrium state x e = 0 
is stable. However, because V is not decrescent, we cannot conclude that the 
equilibrium state x e = 0 is u.s. V 
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Example 3.4.5 Let us consider the following differential equations that arise quite 
often in the analysis of adaptive systems 


X = — X + 4>X 



(3.4.9) 


The equilibrium state is x e = 0, 4> e = c, where c is any constant, and, therefore, the 
equilibrium state is not isolated. 

Let us define <j> = <j> — c, so that (3.4.9) is transformed into 


X = —(1 — C)X + </>X 



(3.4.10) 


We are interested in the stability of the equilibrium point x e = 0, (j) e = c of (3.4.9), 
which is equivalent to the stability of x e = 0,</> e = 0 of (3.4.10). We choose the 
positive definite, decrescent, radially unbounded function 

VW) = y + f (3-4.11) 


Then, 


V(x,4>) = — (1 — c)x 2 


If c > 1, then V > 0 for x ^ 0; therefore, x e = 0, <j> e = 0 is unstable. If, however, 
c < 1, then x e = 0, <j> e = 0 is u.s. For c < 1 we have 


V (x, </>) = —cqX 2 < 0 


(3.4.12) 


where Co = 1 — c > 0. From Theorem 3.4.3 we can also conclude that the solutions 
x(t),<j>(i) are u.b. but nothing more. We can exploit the properties of V and V, 
however, and conclude that x(t) — > 0 as t — > oo as follows. 

From (3.4.11) and (3.4.12) we conclude that because V(t) = V(x(t), <f>{t)) is 
bounded from below and is nonincreasing with time, it has a limit, i.e., lim t _ >00 V ( t ) = 
Vao. Now from (3.4.12) we have 


lim 


x 2 dr = 


x 2 dr = 


V(0) - V K 


< oo 


> 00 Jo Jo c 0 

i.e., x G C 2 - Because the solution x(t),cf>(t) is u.b., it follows from (3.4.10) 
x € Coo, which together with x G C 2 imply (see Lemma 3.2.5) that x(t) — > 
t — > oo. 


that 
0 as 
V 


Example 3.4.6 Consider the differential equation 

±i = — 2a: i + x\x 2 + x 2 
x 2 = — x\ — Xi 
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2 2 

Consider V{x) = ^ We have V = —2x\ < 0 and the equilibrium x le = 

0,a:2e = 0 is u.s. The set defined in Theorem 3.4.4 is given by 

Cl = {x ly x 2 | Xi = 0} 

Because X\ = x 2 on Cl, any solution that starts from Cl with x 2 ^ 0 leaves Cl. 
Hence, X\ = 0,:r 2 = 0 is the only invariant subset of Cl. Therefore the equilibrium 
X\ e = 0, x 2e = 0 is a.s. in the large. V 


In the proceeding examples, we assume implicitly that the differential 
equations considered have unique solutions. As indicated in Section 3.4.1, 
this property is assumed for the general differential equation (3.4.1) on which 
all definitions and theorems are based. The following example illustrates that 
if the property of existence of solution is overlooked, an erroneous stability 
result may be obtained when some of the Lyapunov theorems of this section 
are used. 


Example 3.4.7 Consider the second-order differential equation 

Xi = —2x\ - a; 2 sgn(xi), aq(0) = 1 
x 2 = \x\\, x 2 (0) = 0 


where 


sgn(aii) = 


1 

-1 


if Xi > 0 
if X\ < 0 


The function 


V(xi,x 2 ) =xj + : 


has a time derivative V along the solution of the differential equation that satisfies 


V(xi,x 2 ) = —Ix\ < 0 

Hence, according to Theorem 3.4.1, x±,x 2 €E Hoo and the equilibrium X\ e = 0, x 2e = 
0 is u.s. Furthermore, we can show that Xi(t) — > 0 as t — > oo as follows: Because 
V > 0 and V < 0 we have that lim^oo V(xi(t),x 2 (t)) = Vx, for some Vx G 1Z + . 
Hence, 

Ax\{t)cLt = V (xi(0), a: 2 (0)) ~V{x x (t),x 2 (t)) 

which implies that 

/»oo 

4 / x\(t)(It < oo 

Jo 
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i.e., xi £ Ci- From x\,x 2 € £00 we have that i\ £ which together with 
X\ £ £2 and Lemma 3.2.5 imply that x\(t) — * 0 as t — > 00 . 

The above conclusions are true provided continuous functions Xi(t),X 2 (t) with 
Xi(0) = 1 and # 2 ( 0 ) = 0 satisfying the differential equation for all t £ [0,oo) exist. 
However, the solution of the above differential equation exists only in the time 
interval t £ [0, 1], where it is given by Xi(t) = (l — t)e~ t ,X2{t) = te~*. The difficulty 
in continuing the solution beyond t = 1 originates from the fact that in a small 
neighborhood of the point (aq(l), # 2 ( 1 )) = (0,e _1 ), = — 2 x\ — a? 2 Sgn(xi) < 0 if 

x\ > 0 and x\ > 0 if x\ < 0. This causes X\ to change sign infinitely many times 
around the point (Cfe^ 1 ), which implies that no continuous functions xi(t),X2(t) 
exist to satisfy the given differential equation past the point t = 1. V 

The main drawback of the Lyapunov’s direct method is that, in general, 
there is no procedure for finding the appropriate Lyapunov function that 
satisfies the conditions of Theorems 3.4.1 to 3.4.4 except in the case where 
(3.4.1) represents a LTI system. If, however, the equilibrium state x e = 0 of 
(3.4.1) is u.a.s. the existence of a Lyapunov function is assured as shown in 
[139], 

3.4.3 Lyapunov-Like Functions 

The choice of an appropriate Lyapunov function to establish stability by 
using Theorems 3.4.1 to 3.4.4 in the analysis of a large class of adaptive 
control schemes may not be obvious or possible in many cases. However, a 
function that resembles a Lyapunov function, but does not possess all the 
properties that are needed to apply Theorems 3.4.1 to 3.4.4, can be used to 
establish some properties of adaptive systems that are related to stability 
and boundedness. We refer to such a function as the Lyapunov-like function. 
The following example illustrates the use of Lyapunov-like functions. 

Example 3.4.8 Consider the third-order differential equation 

X\ = -Xl - X2X3, £ 1 ( 0 ) =£10 

±2 = xia; 3 , £2(0) = £20 ( 3 . 4 . 13 ) 

£3 = x\ , x 3 (0) = x 30 

which has the nonisolated equilibrium points in P 3 defined by X\ = 0, £2 = constant, 
X 3 = 0 or Xi = 0, X 2 = 0,x 3 = constant. We would like to analyze the stability 
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properties of the solutions of (3.4.13) by using an appropriate Lyapunov function 
and applying Theorems 3.4.1 to 3.4.4. If we follow Theorems 3.4.1 to 3.4.4, then we 
should start with a function V{xi,X 2 ,x$) that is positive definite in R 3 . Instead of 
doing so let us consider the simple quadratic function 


V{xi,x 2 ) 



x 


2 

2 


2 


which is positive semidefinite in R 3 and, therefore, does not satisfy the positive 
definite condition in R 3 of Theorems 3.4.1 to 3.4.4. The time derivative of V along 
the solution of the differential equation (3.4.13) satisfies 

V=-x\<Q (3.4.14) 


which implies that V is a nonincreasing function of time. Therefore, 
V( Xl (t),x 2 (t)) < V( Xl (0),x 2 (0)) = Vo 


and V,x 1 ,x 2 G Coo- Furthermore, V has a limit as t — > oo, i.e., 


lim V(x 1 (t),x 2 (t)) = Voc 


and (3.4.14) implies that 


and 


[ x\{r)d,T = Vo — V(t), Vi > 0 

Jo 


x\(t)cLt = V 0 -V 00 <oo 


i.e., Xi € C 2 . From X\ € C 2 we have from (3.3.13) that x$ £ C oo and from 
x\,x 2 ,xz G Coo that x\ G Coo ■ Using X\ G Coo, X\ G C 2 and applying Lemma 3.2.5 
we have X\(t) — » 0 as t — > oo. By using the properties of the positive semidefinite 
function V(xi,x 2 ), we have established that the solution of (3.4.13) is uniformly 
bounded and x\(t) — > 0 as t — > oo for any finite initial condition xi(0), x 2 (fi), 
2:3(0). Because the approach we follow resembles the Lyapunov function approach, 
we are motivated to refer to V {x\,x 2 ) as the Lyapunov-like function. In the above 
analysis we also assumed that (3.4.13) has a unique solution. For discussion 
and analysis on existence and uniqueness of solutions of (3.4.13) the reader 
is referred to [191]. V 


We use Lyapunov-like functions and similar arguments as in the example 
above to analyze the stability of a wide class of adaptive schemes considered 
throughout this book. 
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3.4.4 Lyapunov’s Indirect Method 

Under certain conditions, conclusions can be drawn about the stability of the 
equilibrium of a nonlinear system by studying the behavior of a certain linear 
system obtained by linearizing (3.4.1) around its equilibrium state. This 
method is known as the first method of Lyapunov or as Lyapunov’s indirect 
method and is given as follows [32, 232] : Let x e = 0 be an equilibrium state 
of (3.4.1) and assume that f(t,x) is continuously differentiable with respect 
to x for each t > 0. Then in the neighborhood of x e = 0, / has a Taylor 
series expansion that can be written as 


x = f(t, x) = A(t)x + fi(t, x) (3.4.15) 


where Aft) = V/ | x= o is referred to as the Jacobian matrix of / evaluated 
at x = 0 and f\(t, x) represents the remaining terms in the series expansion. 


Theorem 3.4.5 Assume that Aft ) is uniformly bounded and that 


lim sup 

M->o t > o 


|/i(f, a?) | 
\x\ 


= 0 


Let z e = 0 be the equilibrium of 


zft ) = Aft) z ft) 

The following statements are true for the equilibrium x e = 0 of (3.4-15): 

(i) If z e = 0 is u.a.s. then x e = 0 is u.a.s. 

(ii) If z e = 0 is unstable then x e = 0 is unstable 

(iii) If z e = 0 is u.s. or stable, no conclusions can be drawn about the 
stability of x e = 0. 

For a proof of Theorem 3.4.5 see [232]. 


Example 3.4.9 Consider the second-order differential equation 

mx = —2 y{x 2 — l)x — kx 

where m , y, and k are positive constants, which is known as the Van der Pol os- 
cillator. It describes the motion of a mass-spring-damper with damping coefficient 



120 


CHAPTER 3. STABILITY 


2fj,(x 2 — 1) and spring constant k, where x is the position of the mass. If we define 
the states X\ = x, X 2 = x, we obtain the equation 


±i 

X 2 


X 2 

k 2/i 2 

XI lx 1 — 1)X 2 

m m 


which has an equilibrium at X\ e = 0,x 2e = 0. The linearization of this system 
around (0, 0) gives us 

Zl 

. 

Because ?n,/i > 0 at least one of the eigenvalues of the matrix A is positive and 
therefore the equilibrium (0, 0) is unstable. V 



0 

l 


Zl 


_k_ 

m 

2M 

m _ 


Z 2 


3.4.5 Stability of Linear Systems 

Equation (3.4.15) indicates that certain classes of nonlinear systems may be 
approximated by linear ones in the neighborhood of an equilibrium point or, 
as often called in practice, operating point. For this reason we are interested 
in studying the stability of linear systems of the form 

x{t ) = A(t)x(t) (3.4.16) 

where the elements of A(t) are piecewise continuous for all t > to > 0, as a 
special class of the nonlinear system (3.4.1) or as an approximation of the 
linearized system (3.4.15). The solution of (3.4.16) is given by [95] 

x(t;t 0 ,x 0 ) = 4>(t,f 0 )a4) 

for all t, > to, where 4>(f,fo) is the state transition matrix and satisfies the 
matrix differential equation 

J^$(Mo) = A(t)$>(t,t 0 ), Vt>t 0 
to) = I 

Some additional useful properties of 4>(f , to) are 

(i) 4>(f,fo) = 4>(t, t)4>(t, to) Vt > r > to (semigroup property) 

(ii) 4>(t, t 0 ) _1 = <h(to, t) 

(iii) J^$(Mo) = -4>(t,toM(to) 
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Necessary and sufficient conditions for the stability of the equilibrium state 
x e = 0 of (3.4.16) are given by the following theorems. 

Theorem 3.4.6 Let ||4>(t, r)|| denote the induced matrix norm o/4>(t, r) at 
each time t > r. The equilibrium state x e = 0 of (3.4-16) is 

(i) stable if and only if the solutions of (3-4.16) are bounded or equivalently 

c(t 0 ) = sup ||<f>(Mo)|| < oo 
t>t 0 

(ii) u.s. if and only if 

Co = sup c(ffi) = sup ( sup ||$(f,fo)|| ) < oo 
to>0 to>0 J 

(iii) a.s. if and only if 

lirn ||$(Mo)|| = 0 

t — >OC 

for any to E IZ + 

(iv) u.a.s. if and only if there exist positive constants a and (3 such that 

||$(Mo)|| <ae~W- t0 \ \/t > t 0 > 0 

(v) e.s. if and only if it is u.a.s. 

(vi) a.s., u.a.s., e.s. in the large if and only if it is a.s., u.a.s., e.s., respec- 
tively. 

Theorem 3.4.7 [1] Assume that the elements of A(t) are u.b. for all t E 
1Z + . The equilibrium state x e = 0 of the linear system (3.4-16) is u.a.s. if 
and only if given any positive definite matrix Q(t), which is continuous in 
t, and satisfies 

0 < c\I < Q(t ) < C 2 I < 00 
for all t > to, the scalar function defined by 

roo 

V(t, x) = x T J <h T (r, t)Q('r) < l>(r, t) dr x (3.4.17) 

exists (i.e., the integral defined by (3.4-17) is finite for finite values of x and 
t) and is a Lyapunov function of (3.4-16) with 

V(t,x) = — x T Q(t)x 
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It follows using the properties of <h(t, to) that P(t) = / t °°$ T (r, t)Q(r) < I ) (r, t)dr 
satisfies the equation 

P(t) = —Q(t) - A T (t)P(t) - P(t)A(t) (3.4.18) 

i.e. , the Lyapunov function (3.4.17) can be rewritten as V(t,x) = x T P(t)x , 
where P(t) = P T (t) satisfies (3.4.18). 

Theorem 3.4.8 A necessary and sufficient condition for the u.a.s of the 
equilibrium x e = 0 of (3. 4-16) is that there exists a symmetric matrix P(t) 
such that 


111 < P(t) < 72 1 

P(t) + A T (t)P{t) + P(t)A(t) + vC(t)C T (t ) < O 

are satisfied Vt > 0 and some constant u > 0, where 71 > 0, 72 > 0 are con- 
stants and C (f) is such that (C (t) , A(t)) is a UCO pair (see Definition 3.3.3). 

When A(t) = A is a constant matrix, the conditions for stability of the 
equilibrium x e = 0 of 

x = Ax (3.4.19) 

are given by the following theorem. 

Theorem 3.4.9 The equilibrium state x e = 0 of (3-4.19) is stable if and 
only if 

(i) All the eigenvalues of A have nonpositive real parts. 

(ii) For each eigenvalue A* with Re{Aj} = 0, A,; is a simple zero of the 
minimal polynomial of A (i.e., of the monic polynomial tf(X) of least 
degree such that ip (A) = O). 

Theorem 3.4.10 A necessary and sufficient condition for x e = 0 to be a.s. 
in the large is that any one of the following conditions is satisfied 1 : 

(i) All the eigenvalues of A have negative real parts 

J Note that (iii) includes (ii). Because (ii) is used very often in this book, we list it 
separately for easy reference. 
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(ii) For every positive definite matrix Q, the following Lyapunov matrix 
equation 

A t P + PA = -Q 

has a unique solution P that is also positive definite. 

(iii) For any given matrix C with ( C , A) observable, the equation 

A t P + PA = -C T C 

has a unique solution P that is positive definite. 

It is easy to verify that for the LTI system given by (3.4.19), if x e = 0 is 
stable, it is also u.s. If x e = 0 is a.s., it is also u.a.s. and e.s. in the large. 

In the rest of the book we will abuse the notation and call the matrix 
A in (3.4.19) stable when the equilibrium x e = 0 is a.s., i.e., when all the 
eigenvalues of A have negative real parts and marginally stable when x e = 0 
is stable, i.e., A satisfies (i) and (ii) of Theorem 3.4.9. 

Let us consider again the linear time- varying system (3.4.16) and suppose 
that for each fixed t all the eigenvalues of the matrix Aft) have negative real 
parts. In view of Theorem 3.4.10, one may ask whether this condition for 
Aft ) can ensure some form of stability for the equilibrium x e = 0 of (3.4.16). 
The answer is unfortunately no in general, as demonstrated by the following 
example given in [232]. 


Example 3.4.10 Let 


m 


— 1 + 1.5 cos 2 t 1 — 1.5 sin t cost 
— 1 — 1.5 sin t cost — 1 + 1.5 sin 2 t 


The eigenvalues of Aft) for each fixed t, 

X (A(t)) = -.25 ± j.5v / L75 


have negative real parts and are also independent of t. Despite this the equilibrium 
x e = 0 of (3.4.16) is unstable because 

e 5t cos t e _t sin t 
$(t, 0) = 5 1 . . - t 

\ / — e-°'sm t e cost 


is unbounded w.r.t. time t. 


V 
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Despite Example 3.4.10, Theorem 3.4.10 may be used to obtain some 
sufficient conditions for a class of Aft), which guarantee that x e = 0 of 
(3.4.16) is u.a.s. as indicated by the following theorem. 

Theorem 3.4.11 Let the elements of A(t) in (3-4.16) be differentiable 2 and 
bounded functions of time and assume that 

(Al) Re{Xi(A(t))} < —a s Vf > 0 and for i = 1,2, ... ,n where a s > 0 is 
some constant. 

(i) If ||A|| £ C, 2 , then the equilibrium state x e = 0 of ( 3-4-16 ) is u.a.s. in 

the large. 

(ii) If any one of the following conditions: 

(a) fl +T \\A(T)\\dT < fiT + a 0 , i.e., (||A||)3 £ S(n) 

(b) fl +T \\A{T)\\ 2 dr < n 2 T + otQ, i.e., \\A\\ £ S{fi 2 ) 

(c) \\A(t) ||</i 

is satisfied for some «o, l J £ 'R + and Vt > 0, T > 0, then there exists a 
H* > 0 such that if n £ [0,/i*), the equilibrium state x e of ( 3-4-16 ) is 
u.a.s. in the large. 

Proof Using (Al), it follows from Theorem 3.4.10 that the Lyapunov equation 

A r (t)P(t) + P(t)A(t) = -I (3.4.20) 

has a unique bounded solution P(t) for each fixed t. We consider the following 
Lyapunov function: 

V (t, x) = x 1 P(t)x 

Then along the solution of (3.4.16) we have 

V = — |x(t)| 2 + x 1 {t)P{t)x{t) (3.4.21) 

From (3.4.20), P satisfies 

A T (t)P{t) + P(t)A(t) = —Q(t), Vt> 0 (3.4.22) 

where Q(t ) = A T (t)P(t) + P(t)A(t). Because of (Al), it can be verified [95] that 

nOO 

P(t)= / e AT(t)T Q{t)e A(t)T dT 
Jo 

2 The condition of differentiability can be relaxed to Lipschitz continuity. 
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satisfies (3.4.22) for each t > 0, therefore, 

/»oo 

\\Ht)\\ < IIQWII / l|e^ T(t) l||e^||dr 

JO 

Because (Al) implies that ||e A ^A|| < a 1 e _ “ oT for some ai,ao > 0 it follows that 


rail <ciiQ(*)ii 


for some c > 0. Then, 

||Q(t)||<2||P(t)||||i(t)|| 
together with P € £oo imply that 

\\P(t)\\ <P\\A(t)\\, Vt > 0 (3.4.23) 

for some constant (3 > 0. Using (3.4.23) in (3.4.21) and noting that P satisfies 
0 < (3\ < Knin ( P) < A max{P ) < /?2 for some /3 i,/? 2 > 0, we have that 

m < -\x(t)\ 2 + (3\\A(t)\\\x(t)\ 2 < -(3i l V{t) + (3^\\A{t)\\V{t) 


therefore, 




Let us prove (ii) first. Using condition (a) in (3.4.24) we have 


(3.4.24) 


V(t) < e~ 1_/3/3 i V)(*-‘»)eW 1 lao V(t 0 ) 

Therefore, for /i* = ^ and V/i € [0,/i*),V ( t ) — > 0 exponentially fast, which implies 
that x e = 0 is u.a.s. in the large. 

To be able to use (b), we rewrite (3.4.24) as 


V(t) < l,A(T)l,dT V(t 0 ) 


Using the Schwartz inequality and (b) we have 



\\M T )\\ dr 


< 


< 

< 


(yJt ||i(r)|| 2 dr^ Vt-t-o 

[/u 2 (t - t 0 ) 2 + a 0 (t - t 0 )] 2 
/i(t - t 0 ) + -v foo\/t - t 0 


Therefore, 


V(t) < e~ ol A~ t °)y(t)V{t 0 ) 
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where a = (1 — 'i)j3 2 1 — 00i V> 


y(t) 


exp [— 7/3 2 1 (t - t 0 ) + 00i 1 v / “oV / ^ r ^o] 

, \ 2 

00i y/ao \ 


exp 


-7/3 2 1 Vt - to ^ 


27/£ 


oq0 2 02 

470? 


and 7 is an arbitrary constant that satisfies 0 < 7 < 1. It can be shown that 


y(t) < exp 


«0 


0 2 02 

4701 


A 


= c Vt > <0 


hence, 

V(t) < ce- a( ‘- to V(t 0 ) 

Choosing /z* = ^ 1 ^ 7 - ) , we have that V/x € [0, /x*), a > 0 and, therefore, V(t) — » 0 
exponentially fast, which implies that x e = 0 is u.a.s in the large. 

Since (c) implies (a), the proof of (c) follows directly from that of (a). 

The proof of (i) follows from that of (ii) (b), because ||A|| G £ 2 implies (b) with 
H = 0 . □ 


In simple words, Theorem 3.4.11 states that if the eigenvalues of A(t) 
for each fixed time t have negative real parts and if A(t) varies sufficiently 
slowly most of the time, then the equilibrium state x e = 0 of (3.4.16) is u.a.s. 


3.5 Positive Real Functions and Stability 

3.5.1 Positive Real and Strictly Positive Real Transfer Func- 
tions 

The concept of PR and SPR transfer functions plays an important role in the 
stability analysis of a large class of nonlinear systems, which also includes 
adaptive systems. 

The definition of PR and SPR transfer functions is derived from network 
theory. That is, a PR (SPR) rational transfer function can be realized as 
the driving point impedance of a passive (dissipative) network. Conversely, a 
passive (dissipative) network has a driving point impedance that is rational 
and PR (SPR). A passive network is one that does not generate energy, i.e. , a 
network consisting only of resistors, capacitors, and inductors. A dissipative 
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network dissipates energy, which implies that it is made up of resistors and 
capacitors, inductors that are connected in parallel with resistors. 

In [177, 204], the following equivalent definitions have been given for PR 
transfer functions by an appeal to network theory. 

Definition 3.5.1 A rational function G(s) of the complex variable s = a + 
juo is called PR if 

(i) G(s) is real for real -s. 

(ii) Re[G(s)] > 0 for all Refs] > 0. 

Lemma 3.5.1 A rational proper transfer function G(s) is PR if and only if 

(i) G(s) is real for real s. 

(ii) G(s) is analytic in Re[s] > 0, and the poles on the jco-axis are simple 
and such that the associated residues are real and positive. 

(iii) For all real value ui for which s = jco is not a pole of G(s), one has 
Re[G(jcu)] > 0. 

For SPR transfer functions we have the following Definition. 

Definition 3.5.2 [177] Assume that G(s) is not identically zero for all s. 
Then G(s) is SPR if G(s — e) is PR for some e > 0. 

The following theorem gives necessary and sufficient conditions in the fre- 
quency domain for a transfer function to be SPR: 

Theorem 3.5.1 [89] Assume that a rational function G(s) of the complex 
variable s = a + ju is real for real s and is not identically zero for all s. Let 
n* be the relative degree of G(s) = Z(s)/R(s) with |n*| < 1. Then, G(s) is 
SPR if and only if 

(i) G(s) is analytic in Refs] > 0 

(ii) Re[G(jcu)] > 0, Vcu £ (— 00 , 00 ) 

(iii) (a) When n* = 1, lim^i^^ cu 2 Re[G(jcu)] > 0. 

(b) When n* = -1, lim^^ > 0. 
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It should be noted that when n* = 0, (i) and (ii) in Theorem 3.5.1 are 
necessary and sufficient for G(s) to be SPR. This, however, is not true for 
n* = 1 or — 1. For example, 

G(s) = (s T ol T P)/\{s + cr)(s + (T)\ 

a, (3 > 0 satisfies (i) and (ii) of Theorem 3.5.1, but is not SPR because it 
does not satisfy (iiia). It is, however, PR. 

Some useful properties of SPR functions are given by the following corol- 
lary. 

Corollary 3.5.1 (i) G(s) is PR (SPR) if and only ifl/G(s) is PR (SPR) 

(ii) If G(s) is SPR, then, |n*| < 1, and the zeros and poles of G(s) lie in 
Re[s] < 0 

(iii) If \n*\ > 1, then G{s ) is not PR. 


A necessary condition for G(s) to be PR is that the Nyquist plot of G(jto) 
lies in the right half complex plane, which implies that the phase shift in the 
output of a system with transfer function G(s) in response to a sinusoidal 
input is less than 90°. 


Example 3.5.1 Consider the following transfer functions: 

s-1 .... „ , . 1 


(i) Gi(s) — 
(iii) G 3 (s) = 


(s + 2) 2 ’ 
s H- 3 

(s + l)(s + 2) 


(ii) G 2 (s) = 
(iv) G 4 (s) = 


{s + 2) 2 


a 


Using Corollary 3.5.1 we conclude that Gi(s) is not PR, because 1/Gi(s) is not 
PR. We also have that G 2 (s) is not PR because n* > 1. 

For G 3 (s) we have that 


Re[G 3 (jw)] = (g _ ^ 2 + 9 ^ 2 >0, Vue (-oo, oo) 

which together with the stability of G 3 (s) implies that G 3 (s) is PR. Because G 3 (s) 
violates (iii) (a) of Theorem 3.5.1, it is not SPR. 

The function G 4 (s) is SPR for any a > 0 and PR, but not SPR for a = 0. V 


The relationship between PR, SPR transfer functions, and Lyapunov sta- 
bility of corresponding dynamic systems leads to the development of several 
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stability criteria for feedback systems with LTI and nonlinear parts. These 
criteria include the celebrated Popov’s criterion and its variations [192]. The 
vital link between PR, SPR transfer functions or matrices and the existence 
of a Lyapunov function for establishing stability is given by the following 
lemmas. 

Lemma 3.5.2 (Kalman- Yakub ovich-Popov (KYP) Lemma) [7, 192] 

Given a square matrix A with all eigenvalues in the closed left half complex 
plane, a vector B such that ( A , B ) is controllable, a vector C and a scalar 
d > 0, the transfer function defined by 

G(s) = d + C T (sI - A)~ 1 B 

is PR if and only if there exist a symmetric positive definite matrix P and 
a vector q such that 

A T P + PA = - qq r 
PB-C = ±(V2d)q 

Lemma 3.5.3 (Lefschetz-Kalman- Yakubovich (LKY) Lemma) [89, 
126] Given a stable matrix A, a vector B such that ( A , B) is controllable, a 
vector C and a scalar d > 0, the transfer function defined by 

G(s) = d + C T (sI - A)~ 1 B 

is SPR if and only if for any positive definite matrix L, there exist a sym- 
metric positive definite matrix P , a scalar v > 0 and a vector q such that 


A T P + PA= -qq T - vL 


PB-C = ±qV2d 

The lemmas above are applicable to LTI systems that are controllable. This 
controllability assumption is relaxed in [142, 172]. 

Lemma 3.5.4 (Meyer-Kalman-Yakubovich (MKY) Lemma) Given 
a stable matrix A, vectors B , C and a scalar d > 0, we have the following: 

If 


G(s) = d + C T (sI - A)~ l B 
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is SPR, then for any given L = L T > 0, there exists a scalar v > 0, a vector 
q and a P = P T > 0 such that 

A t P + PA = -qq J - vL 

PB-C = ±qV2d 

In many applications of SPR concepts to adaptive systems, the transfer 
function G(s) involves stable zero-pole cancellations, which implies that the 
system associated with the triple ( A , B , C) is uncontrollable or unobservable. 
In these situations the MKY lemma is the appropriate lemma to use. 


Example 3.5.2 Consider the system 

y = G{s)u 

where G(s ) = . We would like to verify whether G(s) is PR or SPR by 

using the above lemmas. The system has the state space representation 

x = Ax + Bu 
y = C T x 

where 

A = 

According to the above 

PB = C 


0 1 
—2 -3 


B = 


C = 


lemmas, if G(s) is PR, we have 


which implies that 


P= Pi 3 

[ 3 1 

for some pi > 0 that should satisfy pi — 9 > 0 for P to be positive definite. We 
need to calculate pi , v > 0 and q such that 

A T P + PA = —qq T — vL (3.5.1) 


is satisfied for any L = L T > 0. We have 


12 11 -pi 

11 -Pi 0 


= —qq — vL 


Now Q = ^ 
for pi 11. 


12 
11 ~Pi 


11 - Pi 
0 


is positive semidefinite for pi = 11 but is indefinite 

i X - J 

because no pi > 9 exists for which Q is positive definite, no v > 0 can 



3.5. POSITIVE REAL FUNCTIONS AND STABILITY 


131 


be found to satisfy (3.5.1) for any given L = L T > 0. Therefore, G(s ) is not SPR, 
something we have already verified in Example 3.5.1. For p\ = 11, we can select 
v = 0 and q = [VT2, 0] T to satisfy (3.5.1). Therefore, G(s) is PR. V 


The KYP and MKY Lemmas are useful in choosing appropriate Lya- 
punov or Lyapunov-like functions to analyze the stability properties of a 
wide class of continuous-time adaptive schemes for LTI plants. We illustrate 
the use of MKY Lemma in adaptive systems by considering the stability 
properties of the system 


e = A c e + B c 6 t oj 

0 = -Teiu (3.5.2) 

ei = C T e 

that often arises in the analysis of a certain class of adaptive schemes. In 

(3.5.2) , T = r T > 0 is constant, e £ 7 Z m ,6 £ lZ n ,e\ £ IZ 1 and oj = Cj e + 
Cj e rn , where e m is continuous and e m £ Coo. The stability properties of 

(3.5.2) are given by the following theorem. 

Theorem 3.5.2 If A c is a stable matrix and G(s) = C T (si — A C )~ 1 B C is 
SPR, then e, 8, ui £ Coo', e, 9 £ Coo D £2 and e(t), e\(t), 9(t) — > 0 as t — > oo. 

Proof Because we made no assumptions about the controllability or observability 
of (A c , B c ,C c ), we concentrate on the use of the MKY Lemma. We choose the 
function 

V(e,0) = e T Pe + 0 J T~ 1 6 

where P = P T > 0 satisfies 


AjP + PA C = - qq T - vL 
PB c = C 

for some vector q , scalar v > 0 and L = L T > 0 guaranteed by the SPR property 
of G(s) and the MKY Lemma. Because u> = Gq e + Cj e rn and e m £ C ^ can be 
treated as an arbitrary continuous function of time in Loo, we can establish that 
V(e, 9) is positive definite in TZ n+m , and V is a Lyapunov function candidate. 

The time derivative of V along the solution of (3.5.2) is given by 

V = —e T qq T e — ve 1 Le < 0 
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which by Theorem 3.4.1 implies that the equilibrium e e , 9 e = 0 is u.s. and e±, e, 9 £ 
Coo ■ Because w = Cq e + Cj e m and e m £ Coo, we also have w £ Coo- By exploit- 
ing the properties of V, V further, we can obtain additional properties about the 
solution of (3.5.2) as follows: From V > 0 and V < 0 we have that 

lim V(e(t),0(t)) = V oo 

t—> OO 

and, therefore, 

/♦OO 

v / e T Ledr < Vo — Voo 

Jo 

where Vo = U(e(0), 0(0)). Because v\ m in{L)\e\ 2 < ve 1 Le, it follows that e £ C 2 . 
From (3.5.2) we have |0(t)| < ||r|||ei||u;|. Since u £ Coo and ei € Coo 0^2 it follows 
from Lemma 3.2.3 (ii) that 9 G Using e,0,io £ Coo we obtain e € Coo, 

which together with e € C 2 , implies that e{t) — > 0 as t — > 00 . Hence, e\, 9, e — > 0 as 
t — > 00 , and the proof is complete. □ 

The arguments we use to prove Theorem 3.5.2 are standard in adaptive 
systems and will be repeated in subsequent chapters. 

3.5.2 PR and SPR Transfer Function Matrices 

The concept of PR transfer functions can be extended to PR transfer function 
matrices as follows. 

Definition 3.5.3 [7, 172] Annxn matrix G(s) whose elements are rational 
functions of the complex variable s is called PR if 

(i) G(s) has elements that are analytic for Re[s] > 0 

(ii) G*(s) = G(s*) for Re[s] > 0 

(iii) G t (s*) + G(s) is positive semidefinite for Re[s] > 0 
where * denotes complex conjugation. 

Definition 3.5.4 [7] An n x n matrix G(s) is SPR if G(s — e) is PR for 
some e > 0. 

Necessary and sufficient conditions in the frequency domain for G(s) to be 
SPR are given by the following theorem. 
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Theorem 3.5.3 [215] Consider the n x n rational transfer matrix 

G(s) = C T (si — A)~ 1 B + D (3.5.3) 

where A, B, C, and D are real matrices with appropriate dimensions. As- 
sume that G(s)-\-G T (— s) has rank n almost everywhere in the complex plane. 
Then G(s) is SPR if and only if 

(i) all elements of G(s) are analytic in Re[s] > 0. 

(ii) G(juS) + G t (— jui) > 0 Vo; € IZ. 

(iii) (a) lim io 2 [G(jio) + G T (— jiv)\ > 0, D + D T > 0 if det[D + D T ] = 0. 
(b) lim [G(ju) + G T (—ju))\ > 0 if det[D + D T ] / 0. 

Necessary and sufficient conditions on the matrices A. B, C, and D in 
(3.5.3) for G(s) to be PR, SPR are given by the following lemmas which are 
generalizations of the lemmas in the SISO case to the MIMO case. 

Theorem 3.5.4 [215] Assume that G(s) given by (3.5.3) is such that G(s) + 
G T (—s) has rank n almost everywhere in the complex plane, det(s/ — A) has 
all its zeros in the open left half plane and (A, B) is completely controllable. 
Then G(s) is SPR if and only if for any real symmetric positive definite 
matrix L, there exist a real symmetric positive definite matrix P, a scalar 
v > 0, real matrices Q and K such that 

A t P + PA = -QQ t - nL 

PB = C ± QK 
K t K = D t + D 

Lemma 3.5.5 [177] Assume that the transfer matrix G{s) has poles that 
lie in Refs] < —7 where 7 > 0 and ( A , B , C, D ) is a minimal realization of 
G(s). Then, G{s) is SPR if and only if a matrix P = P T > 0, and matrices 
Q , K exist such that 

A t P + PA = -QQ t - 27 P 

PB = C± QK 
K t K = D + D t 
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Figure 3.2 General feedback system. 


The SPR properties of transfer function matrices are used in the analysis 
of a class of adaptive schemes designed for multi-input multi-output (MIMO) 
plants in a manner similar to that of SISO plants. 


3.6 Stability of LTI Feedback Systems 

3.6.1 A General LTI Feedback System 

The block diagram of a typical feedback system is shown in Figure 3.2. Here 
Gq(s) represents the transfer function of the plant model and C(s), F(s) are 
the cascade and feedback compensators, respectively. The control input u 
generated from feedback is corrupted by an input disturbance d u to form the 
plant input uq. Similarly, the output of the plant yo is corrupted by an output 
disturbance d to form the actual plant output y. The measured output y n , 
corrupted by the measurement noise d n , is the input to the compensator 
F(s) whose output y e is subtracted from the reference (set point) y* to form 
the error signal e. 

The transfer functions Go(.s), C(s), F(s) are generally proper and causal, 
and are allowed to be rational or irrational, which means they may include 
time delays. 

The feedback system can be described in a compact I/O matrix form by 
treating R = [y*, d u , d, d n ] T as the input vector, and E = [e,uo,y,y n ] T and 
Y = [yo,y e ,u] T as the output vectors, i.e., 


E = H(s)R, Y = RE + I 2 R 


(3.6.1) 
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where 
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3.6.2 Internal Stability 

Equation (3.6.1) relates all the signals in the system with the external inputs 
y * , d u , d, d n . From a practical viewpoint it is important to guarantee that 
for any bounded input vector R, all the signals at any point in the feedback 
system are bounded. This motivates the definition of the so-called internal 
stability [152, 231]. 

Definition 3.6.1 The feedback system is internally stable if for any bounded 
external input R, the signal vectors Y, E are bounded and in addition 


mioo < Cill-RHoo, Halloo < C 2 1 1 -R 1 1 oo 

for some constants ci,c 2 > 0 that are independent of R. 


(3.6.2) 


A necessary and sufficient condition for the feedback system (3.6.1) to be 
internally stable is that each element of H(s) has stable poles, i.e. , poles in 
the open left half .s-plane [231]. 

The concept of internal stability may be confusing to some readers be- 
cause of the fact that in most undergraduate books the stability of the feed- 
back system is checked by examining the roots of the characteristic equation 

1 + FCG 0 = 0 

The following example is used to illustrate such confusions. 

Example 3.6.1 Consider 

Go(s ) = COO = ^§, *00 = 1 
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for which the characteristic equation 


1 + FCG 0 = 1 + 


(a ~2) 

(s + 5)(s — 2) 


1 + 


1 

7+5 


= 0 


(3.6.3) 


has a single root at s = —6, indicating stability. On the other hand, the transfer 
matrix H(s) calculated from this example yields 


e 


"s + 5 

_ (s + 5)/ (s _2) 

-(s + 5) 

-(s + 5) 

u 0 

1 

s — 2 

s + 5 

-(s-2) 

-(s-2) 

y 

s + 6 

1 

(s + 5)/(s-2) 

s + 5 

1 

Vn _ 


1 

( s + 5)/(s — 2) 

s + 5 

s + 5 


(3.6.4) 


indicating that a bounded d u will produce unbounded e,y,y n , i.e., the feedback 
system is not internally stable. We should note that in calculating (3.6.3) and 
(3.6.4), we assume that fp)|g(s) and g(s) are the same transfer functions for any 
constant a. In (3.6.3) the exact cancellation of the pole at s = 2 of G 0 (s) by 
the zero at s = 2 of C(s) led to the wrong stability result, whereas in (3.6.4) 
such cancellations have no effect on internal stability. This example indicates that 
internal stability is more complete than the usual stability derived from the roots 
of 1 + FCGq = 0. If, however, Gq,C,F are expressed as the ratio of coprime 
polynomials, i.e., G 0 (s) = ,G(s) = ^j,F(s) = then a necessary and 

sufficient condition for internal stability [231] is that the roots of the characteristic 
equation 

PoPcPf + n 0 n c n.f = 0 (3.6.5) 

are in the open left-half s-plane. For the example under consideration, no = l,po = 
s — 2, n c = s — 2, p c = s + 5, n/ = l,pf = 1 we have 


(s - 2)(s + 5) + (s - 2) = (s - 2 )(s + 6) = 0 


which has one unstable root indicating that the feedback is not internally stable. 

V 


3.6.3 Sensitivity and Complementary Sensitivity Functions 

Although internal stability guarantees that all signals in the feedback system 
are bounded for any bounded external inputs, performance requirements put 
restrictions on the size of some of the signal bounds. For example, one of the 
main objectives of a feedback controller is to keep the error between the plant 
output y and the reference signal y* small in the presence of external inputs, 
such as reference inputs, bounded disturbances, and noise. Let us consider 
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the case where in the feedback system of Figure 3.2, F(s) = 1, d u = 0. Using 
(3.6.1), we can derive the following relationships between the plant output 
y and external inputs y*, d, d n : 


y = T 0 y* + S 0 d - T 0 d n (3.6.6) 

where 

a A 1 rp A CGp 
0 “ 1 + CGo ’ 0 “ 1 + CGo 

are referred to as the sensitivity function and complementary sensitivity 
function, respectively. It follows that So, To satisfy 


So + T 0 = 1 (3.6.7) 

It is clear from (3.6.6) that for good tracking and output disturbance rejec- 
tion, the loop gain Lo = CGo has to be chosen large so that So ~ 0 and 
To ~ 1. On the other hand, the suppression of the effect of the measure- 
ment noise d n on y requires Lo to be small so that To ~ 0, which from 
(3.6.7) implies that So ~ 1. This illustrates one of the basic trade-offs in 
feedback design, which is good reference tracking and disturbance rejection 
(| To | 3> l,So ~ 0,To ~ 1) has to be traded off against suppression of 
measurement noise (|To| <C 1, So ~ 1, To « 0). In a wide class of control 
problems, y* , d are usually low frequency signals, and d n is dominant only at 
high frequencies. In this case, C(s) can be designed so that at low frequen- 
cies the loop gain Lo is large, i.e., So ~ 0, To « 1, and at high frequencies To 
is small, i.e., So ~ 1,T 0 « 0. Another reason for requiring the loop gain To 
to be small at high frequencies is the presence of dynamic plant uncertainties 
whose effect is discussed in later chapters. 

3.6.4 Internal Model Principle 

In many control problems, the reference input or setpoint y* can be modeled 
as 

Q r {s)y* = 0 (3.6.8) 

where Q r {s) is a known polynomial, and s = ^ is the differential operator. 
For example, when y* =constant, Q r (s) = s. When y* = t, Q r {s) = s 2 and 
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when y* = Asinuot for some constants A and cuo, then Q r (s ) = s 2 + ljq, etc. 
Similarly, a deterministic disturbance d can be modeled as 


Qd(s)d = 0 


(3.6.9) 


for some known Qd(s), in cases where sufficient information about d is avail- 
able. For example, if d is a sinusoidal signal with unknown amplitude and 
phase but with known frequency ujd then it can be modeled by (3.6.9) with 
Qd(s) = S 2 +tuj. 

The idea behind the internal model principle is that by including the 
factor q ( s )Q d {s) th e compensator C(s), we can null the effect of y*,d on 
the tracking error e = y* — y. To see how this works, consider the feedback 
system in Figure 3.2 with F(s) = 1 , d u = d n = 0 and with the reference 
input y* and disturbance d satisfying (3.6.8) and (3.6.9) respectively for some 
known polynomials Q r {s), Qd(s). Let us now replace C(s) in Figure 3.2 with 

C W = 7Tr\’ QOO = Qr(s)Qd(s) (3.6.10) 

where C(s) in (3.6.10) is now chosen so that the poles of each element of 
H(s) in (3.6.1) with C(s) replaced by C(s) are stable. From (3.6.6) with 
d u = d n = 0 and C replaced by C/Q, we have 

e = y*-y = y+~tgk v * ~ TV^ d = q + cg 0 q{v * - d) 


Because Q = Q r Qd nulls d,y*, i.e., Q(s)d = 0 ,Q(s)y* = 0, it follows that 


e = 


1 

Q + CGo 


[ 0 ] 


which together with the stability of 1 /{Q + CGq ) guaranteed by the choice 
of C(s) imply that e(t ) = y*(t) — y(t) tends to zero exponentially fast. 

The property of exact tracking guaranteed by the internal model principle 
can also be derived from the values of the sensitivity and complementary 
sensitivity functions So, To at the frequencies of y*,d. For example, if y* = 
sincuoi and d = sinuqt, i.e., Q(s) = (s 2 + Wq )(s 2 + u 2 ) we have 
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and So(jujo) = So CM) = 0, T 0 (jw 0 ) = T 0 (jui) = 1. 

A special case of the internal model principle is integral control where 
Q(s) = s which is widely used in industrial control to null the effect of 
constant set points and disturbances on the tracking error. 


3.7 Problems 

3.1 (a) Sketch the unit disk defined by the set {a; \x € 7Z 2 , |aj| p < 1} for (i) 

p = 1, (ii) p= 2, (iii) p = 3, and (iv) p = oo. 

(b) Calculate the T 2 norm of the vector function 

x(t) = [e -2t , e _t ] T 

by (i) using the | • 1 2 -norm in IT 2 and (ii) using the | • |oo norm in TZ 2 . 

3.2 Let y = G(s)u and g(t) be the impulse response of G(s). Consider the induced 

norm of the operator T defined as HTHoo = sup|| u || oo=1 ||^||°° , where T : u € 
Loo y (z Loo. Show that HTHoo = ||5||i- 

3.3 Take u(t) = f(t) where f(t), shown in the figure, is a sequence of pulses centered 

at n with width A, and amplitude n, where n = 1,2,..., oo. 



(a) Show that u £ L\ but u £2 and u Loo- 

(b) If y = G(s)u where G(s ) = and u = f, show that y £ L\ f) Loo and 

\y(t)\ — > 0 as t — > 00 . 


3.4 Consider the system depicted in the following figure: 
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Let H“2- dooe * * ^ooe satisfy 

II (i^iei)tll < 7i||eit|| +Pi 

||(^2e 2 )t|| < 72||e 2 t|| + [ e _a(t_r) 7(r)||e 2 r|Mr + ^ 2 

Jo 

for all t > 0, where 71 > 0,72 > 0, a > 0, /3 i ,/?2 are constants, 7 (t) is a 
nonnegative continuous function and ||(-) t ||, ||(-)| denote the £ ooe ,£ oa norm 
respectively. Let ||ui|| < c, || zt2 1| < c for some constant c > 0, and 7172 < 1 
(small gain). Show that 

(a) If 7 € £ 2 , then e \ , e 2 , y\ , y- 2 G £oo- 

(b) If 7 G S(n), then e\,e 2 ,yi,y 2 G £00 for any /i G [0,/i*), where fi* = 

q 2 (l-7i7 2 ) 2 

2C07? ‘ 

3.5 Consider the system described by the following equation: 

x = A(t)x + f(t, x) + u (3.7.1) 

where f(t,x ) satisfies 

\f(t,x)\ < j{t)\x\ + 70(f) 

for all x G 72.”, t > 0 and f(t, 0) = 0, where 7 (t) > 0, 70(f) are continuous 
functions. If the equilibrium y e = 0 of y = A(t)y is u.a.s and 7 G S(ji), show 
that the following statements hold for some fi* > 0 and any /i G [0,/£): 

(a) u G £00 and j 0 G <S(z/) for any ^ > 0 implies x G £oo 

(b) u = 0,7o = 0 implies that the equilibrium x e = 0 of (3.7.1) is u.a.s in 

the large 

(c) u, 70 G £2 implies that x G £00 and liirp_ 00 x{t) = 0 

(d) If u = 70 = 0 then the solution x(t; to, Xo ) of (3.7.1) satisfies 

\x(t;t 0 ,x 0 )\ < K e~ \ x(to) \ for f > f 0 > 0 

where f3 = a—coyK > 0 for some constant Co and I\ , a > 0 are constants 
in the bound for the state transition matrix 4>(f,r) of y = A{i)y , i.e. , 

||$(£t)|| < Ke~ a ^ t ~ T \ for t>r >t 0 
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3.6 Consider the LTI system 

x = {A + eB)x 

where e > 0 is a scalar. Calculate e* > 0 such that for all e € [0,e*), the 
equilibrium state x e = 0 is e.s. in the large when 


( a ) 


A = 


-1 10 

0 -2 ’ 


B = 


1 0 
2 5 


(b) 


A = 


0 10 
0 -1 ’ 



-8 

2 


using (i) the Euclidean norm and (ii) the infinity norm. 


3.7 Consider the system given by the following block diagram: 



where F(s) is designed such that the closed-loop system is internally stable 
when A (s) = 0, i.e., 1+ ( ^ G , 1+ p G are stable transfer functions. Derive condi- 
tions on G, F using the small gain theorem such that the mapping T : r i— > y 
is bounded in £ 2 for any A(s) that satisfies IjA^)^ < S, where S > 0 is a 
given constant. 

3.8 Consider the LTI system 

x = {A + B)x, x e TV 1 

where ReXi(A) < 0 and B is an arbitrary constant matrix. Find a bound on 
||B|| for x e = 0 to be e.s. in the large by 

(a) Using an appropriate Lyapunov function 

(b) Without the use of a Lyapunov function 

3.9 Examine the stability of the equilibrium states of the following differential 

equations: 

(a) x = sin t x 

(b) x = (3tsin< — t)x 
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(c) x = a{t)x, where a(t) is a continuous function with a(t) < 0 Vf > to > 0 


3.10 Use Lyapunov’s direct method to analyze the stability of the following sys- 
tems: 


( a ) 


X\ = ~X\ + X1X2 

&2 = -7®? 


(b) x + 2x 3 + 2x = 0 


(c) x = 


1 -3 

2 -5 


(d) 


x\ = — 2 x\ + X1X2 
X 2 = —x\ — 0X2 


3.11 Find the equilibrium state of the scalar differential equation 

x = —{x — T)(x — 2) 2 


and examine their stability properties using 

(a) Linearization 

(b) Appropriate Lyapunov functions 

3.12 Consider the system described by 


X\ = x 2 

X2 = —X2 — (fc 0 + sinful 

where ko > 0 is a constant. Use an appropriate Lyapunov function to inves- 
tigate stability of the equilibrium states. 

3.13 Check the PR and SPR properties of the systems given by the following 
transfer functions: 

(a) Gr( S ) = 

(b) G 2 ( S ) = ^ 

( C ) ^ 3 ( S ) = (s+3)(s+5) 

(d) G 4 (s) = 

3.14 Assume that the transfer function Gi(s) and G 2 (s) are PR. Check whether 
the following transfer functions are PR in general: 

(a) G 0 (s) = Gi(s) + G 2 (s) 

(b) G m (s) = G 1 (s)G 2 (s) 

( c ) ^/(s) = Gi(s)[l + G 2 (s)]^ 1 
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Repeat (a), (b), and (c) when Gi(s) and G 2 (s ) are SPR. 
3.15 (a) Let 


Gi(s) 


1 

s + r 


L(s) 


s — 1 
(s + 1) 2 


Find a bound 3 on e > 0 for the transfer function 


to be PR, SPR. 

(b) Repeat (a) when 

Gi(a) 


G e (s ) — Gi(s) + eL(s) 


s + 5 

(s + 2) (s + 3) 


L(s) = 


Comment on your results. 

3.16 Consider the following feedback system: 


1 

s + 1 



where a(s) = s 2 — 3s + 2 and b(s) = s + a. Choose the constants do, 61,62 
such that for a = 5, the transfer function 


y(s) 

u(s) 


W m (s) 


1 

s + 1 


What if a = — 5 ? Explain. 

3.17 Consider the following system: 

e = Ae + t, e\ = C T e 

<fi = — ei sin t. 


where (j>,e 1 G 1Z 1 , e G TV 1 , ( A , B ) is controllable and W m (s) = C T (sI—A)~ 1 B 
is SPR. Use Lemma 3.5.3 to find an appropriate Lyapunov function to study 
the stability properties of the system (i.e., of the equilibrium state (\> e = 
0, e e = 0). 


3 Find an e* > 0 such that for all e G [0, e*), G e (s) is PR, SPR. 






Chapter 4 


On-Line Parameter 
Estimation 

4.1 Introduction 

In Chapter 2, we discussed various types of model representations and pa- 
rameterizations that describe the behavior of a wide class of dynamic sys- 
tems. Given the structure of the model, the model response is determined 
by the values of certain constants referred to as plant or model parameters. 
In some applications these parameters may be measured or calculated using 
the laws of physics, properties of materials, etc. In many other applications, 
this is not possible, and the parameters have to be deduced by observing 
the system’s response to certain inputs. If the parameters are fixed for all 
time, their determination is easier, especially when the system is linear and 
stable. In such a case, simple frequency or time domain techniques may 
be used to deduce the unknown parameters by processing the measured re- 
sponse data off-line. For this reason, these techniques are often referred to 
as off-line parameter estimation techniques. In many applications, the struc- 
ture of the model of the plant may be known, but its parameters may be 
unknown and changing with time because of changes in operating conditions, 
aging of equipment, etc., rendering off-line parameter estimation techniques 
ineffective. The appropriate estimation schemes to use in this case are the 
ones that provide frequent estimates of the parameters of the plant model 
by properly processing the plant I/O data on-line. We refer to these scheme 
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as on-line estimation schemes. 

The purpose of this chapter is to present the design and analysis of a 
wide class of schemes that can be used for on-line parameter estimation. The 
essential idea behind on-line estimation is the comparison of the observed 
system response y(t), with the output of a parameterized model y(9, t ) whose 
structure is the same as that of the plant model. The parameter vector 9(t) 
is adjusted continuously so that y(9. t ) approaches y(t) as t increases. Under 
certain input conditions, y(9, t) being close to y(t) implies that 9(t) is close to 
the unknown parameter vector 6* of the plant model. The on-line estimation 
procedure, therefore, involves three steps: In the first step, an appropriate 
parameterization of the plant model is selected. This is an important step 
because some plant models are more convenient than others. The second 
step 

generating or updating 9(t). The adaptive law is usually a differential 
equation whose state is 9(t) and is designed using stability considerations or 
simple optimization techniques to minimize the difference between y{t) and 
y(9, t ) with respect to 9{t ) at each time t. The third and final step is the 
design of the plant input so that the properties of the adaptive law imply 
that 9{t ) approaches the unknown plant parameter vector 6* as t — > oo. 
This step is more important in problems where the estimation of 8* is one of 
the objectives and is treated separately in Chapter 5. In adaptive control, 
where the convergence of 9{t) to 9* is usually not one of the objectives, 
the first two steps are the most important ones. In Chapters 6 and 7, a 
wide class of adaptive controllers are designed by combining the on-line 
estimation schemes of this chapter with appropriate control laws. Unlike 
the off-line estimation schemes, the on-line ones are designed to be used 
with either stable or unstable plants. This is important in adaptive control 
where stabilization of the unknown plant is one of the immediate objectives. 

The chapter is organized as follows: We begin with simple examples 
presented in Section 4.2, which we use to illustrate the basic ideas behind 
the design and analysis of adaptive laws for on-line parameter estimation. 
These examples involve plants that are stable and whose states are avail- 
able for measurement. In Section 4.3, we extend the results of Section 4.2 
to plants that may be unstable, and only part of the plant states is avail- 
able for measurement. We develop a wide class of adaptive laws using a 
Lyapunov design approach and simple optimization techniques based on the 
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gradient method and least squares. The on-line estimation problem where 
the unknown parameters are constrained to lie inside known convex sets is 
discussed in Section 4.4. The estimation of parameters that appear in a spe- 
cial bilinear form is a problem that often appears in model reference adaptive 
control. It is treated in Section 4.5. In the adaptive laws of Sections 4.2 to 
4.5, the parameter estimates are generated continuously with time. In Sec- 
tion 4.6, we discuss a class of adaptive laws, referred to as hybrid adaptive 
laws, where the parameter estimates are generated at finite intervals of time. 
A summary of the adaptive laws developed is given in Section 4.7 where the 
main equations of the adaptive laws and their properties are organized in ta- 
bles. Section 4.8 contains most of the involved proofs dealing with parameter 
convergence. 

4.2 Simple Examples 

In this section, we use simple examples to illustrate the derivation and prop- 
erties of simple on-line parameter estimation schemes. The simplicity of 
these examples allows the reader to understand the design methodologies 
and stability issues in parameter estimation without having to deal with the 
more complex differential equations that arise in the general case. 

4.2.1 Scalar Example: One Unknown Parameter 

Let us consider the following plant described by the algebraic equation 

y (t) = 9*u{t) (4.2.1) 

where u £ C 0 0 is the scalar input, y(t) is the output, and 9* is an unknown 
scalar. Assuming that u(t),y(t) are measured, it is desired to obtain an 
estimate of 9* at each time t. If the measurements of y, u were noise free, 
one could simply calculate 9{t), the estimate of 9* , as 

m = || (4.2.2) 

whenever u{t ) ^ 0. The division in (4.2.2), however, may not be desirable 
because u{t) may assume values arbitrarily close to zero. Furthermore, the 
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effect of noise on the measurement of u, y may lead to an erroneous estimate 
of 9*. The noise and computational error effects in (4.2.2) may be reduced 
by using various other nonrecursive or off-line methods especially when 9* 
is a constant for all t. 

In our case, we are interested in a recursive or on-line method to generate 
9(t). We are looking for a differential equation, which depends on signals 
that are measured, whose solution is 9{t ) and its equilibrium state is 9 e = 9*. 
The procedure for developing such a differential equation is given below. 

Using 9(t) as the estimate of 9* at time t. we generate the estimated or 
predicted value y(t) of the output y(t) as 

y(t) = 9(t)u{t) (4.2.3) 

The prediction or estimation error ei, which reflects the parameter uncer- 
tainty because 9(t) is different from 9*, is formed as the difference between 
y and y, i.e., 

U = y- y = y-9u (4.2.4) 

The dependence of e\ on the parameter estimation error 9 = 9 — 9* becomes 
obvious if we use (4.2.1) to substitute for y in (4.2.4), i.e., 

ei = 9*u — 9u = —9u (4.2.5) 

The differential equation for generating 9(t ) is now developed by minimizing 
various cost criteria of ei with respect to 9 using the gradient or Newton’s 
method. Such criteria are discussed in great detail in Section 4.3. For this 
example, we concentrate on the simple cost criterion 

which we minimize with respect to 9. For each time t, the function J(9) is 
convex over 77. 1 ; therefore, any local minimum of J is also global and satisfies 
V J(9) = 0. One can solve V J{9) = — (y — 9u)u = 0 for 9 and obtain the 
nonrecursive scheme (4.2.2) or use the gradient method (see Appendix B) 
to form the recursive scheme 


9 = — yVJ(0) = 7 (y — 9u)u = 7 ej. 1 t, 0(0) = 9q 


(4.2.6) 



148 


CHAPTER 4. ON-LINE PARAMETER ESTIMATION 



Figure 4.1 Implementation of the scalar adaptive law (4.2.6). 


where 7 > 0 is a scaling constant, which we refer to as the adaptive gain. 
In the literature, the differential equation (4.2.6) is referred to as the update 
law or the adaptive law or the estimator , to name a few. In this book, we 
refer to (4.2.6) as the adaptive law for updating 9(t) or estimating 9* on-line. 

The implementation of the adaptive law (4.2.6) is shown in Figure 4.1. 
The stability properties of (4.2.6) are analyzed by rewriting (4.2.6) in terms 
of the parameter error 9 = 9 — 9*, i.e. , 

9 = 9-9* = 7 e i« - 9* 

Because e\ = 9*u — 9u = —9u and 9* is constant, i.e., 9* = 0, we have 

9 = - 7 u 2 9 , 0( 0) = 0(0) - 9* (4.2.7) 


We should emphasize that (4.2.7) is used only for analysis. It cannot be used 
to generate 0(f) because given an initial estimate 0(0) of 9* , the initial value 
0(0) = 0(0) — 9* , which is required for implementing (4.2.7) is unknown due 
to the unknown 9*. 

Let us analyze (4.2.7) by choosing the Lyapunov function 


V(9) 


02 

2 7 


The time derivative V of V along the solution of (4.2.7) is given by 


V = 


0 T 0 


7 
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which after substitution of 9 from (4.2.7) becomes 

V = -u 2 0 2 = -el<0 (4.2.8) 

which implies that the equilibrium 9 e = 0 of (4.2.7) is u.s. 

Because no further information about u(t) is assumed other than u G Coo, 
we cannot guarantee that V < 0 (e.g., take u(t) = 0) and, therefore, cannot 
establish that 9 e = 0 is a.s. or e.s. We can, however, use the properties of 
V, V to establish convergence for the estimation error and other signals in 

(4.2.6) . For example, because V > 0 is a nonincreasing function of time, the 
limt-xxj V(8(t)) = Voo exists. Therefore, from (4.2.8) we have 

POO POO 

/ e\(r)dT = — V(r)dT = V 0 -Voo 
Jo Jo 

where Vq = V(0(O)), which implies that e\ G £ 2 - Now from (4.2.6) and 
u G Coo, we also have that 9 G £00 f) £2 • Because, as we have shown in 
Chapter 3, a square integrable function may not have a limit, let alone tend 
to zero with time, we cannot establish that e\ (t), 9(t) — ► 0 as t — > 00 without 
additional conditions. If, however, we assume that u G Coo, then it follows 
that e\ = — 9u — Oil G Coo', therefore, from Lemma 3.2.5 we have e\(t) — ► 0 
as t — > 00 , which is implied by e\ G C 2 ,e\ G Coo ■ This, in turn, leads to 

lim 9(t) = lim 9(t) = 0 (4.2.9) 

t—> OO t—> OO 

The conclusion of this analysis is that for any u, u G Coo, the adaptive law 

(4.2.6) guarantees that the estimated output y(t) converges to the actual 
output y(t) and the speed of adaptation (i.e. , the rate of change of the 
parameters 9) decreases with time and converges to zero asymptotically. 

One important question to ask at this stage is whether 9{t ) converges as 
t — > 00 and, if it does, is the lim^oo 9(t) = 9*7 

A quick look at (4.2.9) may lead some readers to conclude that 9{t ) — > 0 
as t — > 00 implies that 9{t) does converge to a constant. This conclusion is 
obviously false because the function 9(t) = sin(ln(2 + t)) satisfies (4.2.9) but 
has no limit. We have established, however, that V(9(t)) = converges to 
Voo us t. — * 00 , i.e., lirn^oo 9 2 {t) = 2^/ Voo, which implies that 6(t) and, there- 
fore, 9(t) does converge to a constant, i.e., lirn^oo 9(t ) = ± v^^T^oo + 9* = 9. 
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Hence the question which still remains unanswered is whether 6 = 6*. It is 
clear from (4.2.1) that for u(t) = 0, a valid member of the class of input sig- 
nals considered, y(t) = 0 Vf > 0, which provides absolutely no information 
about the unknown 6* . It is, therefore, obvious that without additional con- 
ditions on the input u(t),y(t ) may not contain sufficient information about 
6* for the identification of 6* to be possible. 

For this simple example, we can derive explicit conditions on u(t) that 
guarantee parameter convergence by considering the closed-form solution of 
(4.2.7), i.e., 

6( t ) = e -^fo u2 ^ dT 6(0) (4.2.10) 

For inputs u(t) = 0 or u(t) = e _t , 9(t) does not tend to zero, whereas for 
u 2 (t) = -j^, 6(t) tends to zero asymptotically but not exponentially. A 
necessary and sufficient condition for Oft) to converge to zero exponentially 
fast is that u(t) satisfies 

rt+T 0 

J u 2 {t)<1t > a^To (4.2.11) 

Vi > 0 and for some oto,To > 0 (see Problem 4.1). It is clear that u(t) = 1 
satisfies (4.2.11), whereas u(t) = 0 or e - * or ^ does not. The property of u 
given by (4.2.11) is referred to as persistent excitation (PE) and is crucial in 
many adaptive schemes where parameter convergence is one of the objectives. 
The signal u, which satisfies (4.2.11), is referred to be persistently exciting 
(PE). The PE property of signals is discussed in more detail in Section 4.3. 

It is clear from (4.2.10), (4.2.11) that the rate of exponential convergence 
of 6(t) to zero is proportional to the adaptive gain 7 and the constant 07 
in (4.2.11), referred to as the level of excitation. Increasing the value of 7 
will speed up the convergence of 6{t) to 6*. A large 7, however, may make 
the differential equation (4.2.6) “stiff” and, therefore, more difficult to solve 
numerically. 

The same methodology as above may be used for the identification of an 
n-dimensional vector 6* that satisfies the algebraic equation 

y = d* T (f (4.2.12) 

where y £ 7 Z l ,(f> £ 7 Z n are bounded signals available for measurement. We 
will deal with this general case in subsequent sections. 
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4.2.2 First-Order Example: Two Unknowns 

Consider the following first-order plant 

x = —ax + bu, x'(0) = xq (4.2.13) 

where the parameters a and b are constant but unknown, and the input 
u and state x are available for measurement. We assume that a > 0 and 
u G Coo so that x E Coo- The objective is to generate an adaptive law for 
estimating a and b on-line by using the observed signals u(t) and x(t). 

As in Section 4.2.1, the adaptive law for generating the estimates a and 
b of a and b, respectively, is to be driven by the estimation error 

ei = x — x (4.2.14) 

where x is the estimated value of x formed by using the estimates a and b. 
The state x is usually generated by an equation that has the same form as 
the plant but with a and b replaced by a and b, respectively. For example, 
considering the plant equation (4.2.13), we can generate x from 

x = —ax + bu, x(0) = xq (P) 

Equation (P) is known as the parallel model configuration [123] and the 
estimation method based on (P) as the output error method [123, 172]. The 
plant equation, however, may be rewritten in various different forms giving 
rise to different equations for generating x. For example, we can add and 
subtract the term a m x, where a m > 0 is an arbitrary design constant, in 
(4.2.13) and rewrite the plant equation as 

x = —a m x + (a m — a)x + bu 



where 6 * = \b, a m — d T , d> = * u, „ , * x , which is in the form of the 

L J \_S-\-CLm J 

algebraic equation considered in Section 4.2.1. Therefore, instead of using 
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(P), we may also generate x from 

x = —a m x + ( a m — a)x + bu 


that is 

X = [(am - a)x + bu] (SP) 

by considering the parameterization of the plant given by (4.2.15). Equa- 
tion (SP) is widely used for parameter estimation and is known as the series- 
parallel model [123]. The estimation method based on (SP) is called the equa- 
tion error method [123, 172]. Various other models that are a combination of 
(P) and (SP) are generated [123] by considering different paranreterizations 
for the plant (4.2.13). 

The estimation error e\ = x — x satisfies the differential equation 


for model (P) and 
for model (SP) where 


ei = — aei + ax — bu 
ei = — a m ei + ax — bu 
a = a — a, b = b — b 


(PI) 

(SP1) 


are the parameter errors. Equations (PI) and (SP1) indicate how the param- 
eter error affects the estimation error e\. Because a, a m > 0, zero parameter 
error, i.e. , a = b = 0, implies that e\ converges to zero exponentially. Be- 
cause a, b are unknown, ei is the only measured signal that we can monitor 
in practice to check the success of estimation. We should emphasize, how- 
ever, that ei — > 0 does not imply that a, b — > 0 unless some PE properties 
are satisfied by x,x,u as we will demonstrate later on in this section. We 
should also note that e\ cannot be generated from (PI) and (SP1) because 
a and b are unknown. Equations (PI) and (SP1) are, therefore, only used 
for the purpose of analysis. 

Let us now use the error equation (SP1) to derive the adaptive laws for 
estimating a and b. We assume that the adaptive laws are of the form 


a = fi(ei,x,x,u), b = f 2 (ei,x,x,u) 


(4.2.16) 


where f\ and f -2 are functions of measured signals, and are to be chosen so 
that the equilibrium state 

e = a, b e = b, ei e = 0 


a, 


(4.2.17) 
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of the third-order differential equation described by (SP1) (where x G C^ 
is treated as an independent function of time) and (4.2.16) is u.s., or, if 
possible, u.a.s., or, even better, e.s. 

We choose /i, f -2 so that a certain function V (ei, a, b ) and its time deriva- 
tive V along the solution of (SP1), (4.2.16) are such that V qualifies as a Lya- 
punov function that satisfies some of the conditions given by Theorems 3.4.1 
to 3.4.4 in Chapter 3. We start by considering the quadratic function 

V(e\ ,a,b) = -(ef + a~ + b 2 ) (4.2.18) 

which is positive definite, decrescent, and radially unbounded in 1Z? . The 
time derivative of V along any trajectory of (SP1), (4.2.16) is given by 

V = —a m ef + axe i — hue \ + afi + 6/2 (4.2.19) 

and is evaluated by using the identities a = a, b = b, which hold because a 
and b are assumed to be constant. 

If we choose f\ = — e\x, fi = e\ u, we have 

V = -a m e{ < 0 (4.2.20) 

and (4.2.16) becomes 

a = —eix, b = e\u (4.2.21) 

where e\ = x — x and x is generated by (SP). 

Applying Theorem 3.4.1 to (4.2.18) and (4.2.20), we conclude that V 
is a Lyapunov function for the system (SP1), (4.2.16) where x and u are 
treated as independent bounded functions of time and the equilibrium given 
by (4.2.17) is u.s. Furthermore, the trajectory ei (t) , a(t) , b(t) is bounded for 
all t > 0. Because e\ = x — x and x G we also have that x G Coo, 
therefore, all signals in (SP1) and (4.2.21) are uniformly bounded. As in the 
example given in Section 4.2.1, (4.2.18) and (4.2.20) imply that 

lim V(ei(t),a(t),b(t)) = < 00 

t — >00 

and, therefore, 

r 00 1 poo . 1 

/ e\[r)dT = / Vdr = — (F 0 - Foo) 

JO JO 
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where Vo = V(ei(0), a(0), 6(0)), i.e., e\ G £ 2 . Because u,a,b,x,e 1 G Cqc, it 
follows from (SP1) that <7 G T-00 , which, together with e\ G £2, implies that 
ei(t) 0 as t — > 00 , which, in turn, implies that a(t), b(t ) 0 as t — > 00 . 

It is worth noting that e\ (t), a(t), b(t ) — > 0 as t — * 00 do not imply that 
a and b converge to any constant let alone to zero. As in the example of 
Section 4.2.1, we can use (4.2.18) and (4.2.20) and establish that 

lim (a 2 (f) + b 2 {t )) = 21 ^ 

t — >00 

which again does not imply that a and b have a limit, e.g., take 
a(t) = \Z2Voo sin v /1 + t , b{t) = cos Vl + 1 

The failure to establish parameter convergence may motivate the reader 
to question the choice of the Lyapunov function given by (4.2.18) and of the 
functions fi, f -2 in (4.2.19). The reader may argue that perhaps for some 
other choices of V and /i,/ 2 , u.a.s could be established for the equilibrium 
(4.2.17) that will automatically imply that a, b — ► 0 as t — > 00 . Since given 
a differential equation, there is no procedure for finding the appropriate 
Lyapunov function to establish stability in general, this argument appears 
to be quite valid. We can counteract this argument, however, by applying 
simple intuition to the plant equation (4.2.13). In our analysis, we put no 
restriction on the input signal u, apart from u G £oo, and no assumption is 
made about the initial state xo- For u = 0, an allowable input in our analysis, 
and xo = 0 , no information can be extracted about the unknown parameters 
a, b from the measurements of x(t) = 0, u(t) = 0, Vi > 0. Therefore, no 
matter how intelligent an adaptive law is, parameter error convergence to 
zero cannot be achieved when u = 0 Vt > 0. This simplistic explanation 
demonstrates that additional conditions have to be imposed on the input 
signal u to establish parameter error convergence to zero. Therefore, no 
matter what V and /i ,/2 we choose, we can not establish u.a.s. without 
imposing conditions on the input u. These conditions are similar to those 
imposed on the input u in Section 4.2.1, and will be discussed and analyzed 
in Chapter 5. 

In the adaptive law (4.2.21), the adaptive gains are set equal to 1. A 
similar adaptive law with arbitrary adaptive gains 71 , 72 > 0 is derived by 
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considering 


1 


b 2 ' 


V(e 1 ,~a,b) = ^[el + -+"- 

2 \ 7i 72 


instead of (4.2.18). Following the same procedure as before we obtain 


a = — 7ieix, b = 7 2 £i u 


where 71,72 > 0 are chosen appropriately to slow down or speed up adapta- 
tion. 

Using (4.2.18) with model (PI) and following the same analysis as with 
model (SP1), we obtain 


a = — e\x, b = e\u 


(4.2.22) 


and 

V = — ae\ < 0 

Hence, the same conclusions as with (4.2.21) are drawn for (4.2.22). 

We should note that V for (PI) depends on the unknown a, whereas for 
(SP1) it depends on the known design scalar a m . Another crucial difference 
between model (P) and (SP) is their performance in the presence of noise, 
which becomes clear after rewriting the adaptive law for a in (4.2.21), (4.2.22) 
as 

a = — (x — x)x = x 2 — xx (P) 

a = —(x — x)x = — x 2 + xx (SP) 

If the measured plant state x is corrupted by some noise signal v, i.e. , x is 
replaced by x + v in the adaptive law, it is clear that for the model (SP), 
a will depend on v 2 and v, whereas for model (P) only on v. The effect of 
noise ( v 2 ) may result in biased estimates in the case of model (SP), whereas 
the quality of estimation will be less affected in the case of model (P). The 
difference between the two models led some researchers to the development 
of more complicated models that combine the good noise properties of the 
parallel model (P) with the design flexibility of the series-parallel model (SP) 
[47, 123]. 
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Simulations 

We simulate the parallel and series-parallel estimators and examine the ef- 
fects of the input signal u. the adaptive gain and noise disturbance on their 
performance. For simplicity, we consider a first-order example y = -J^u 
with two unknown parameters a and b. Two adaptive estimators 

a = —e\x, b = e\u 

x = —ax + bu , ei = x — x 

and 

e±x, b = e\u 

a m x + (a m — a)x + bu, e\ = x — x 

based on the parallel and series-parallel model, respectively, are simulated 
with a = 2 and 6=1. The results are given in Figures 4.2 and Figure 4.3, 
respectively. Plots (a) and (b) in Figure 4.2 and 4.3 give the time response 
of the estimated parameters when the input u = sin5f, and the adaptive 
gain 7 = 1 for (a) and 7 = 5 for (b). Plots (c) in both figures give the results 
of estimation for a step input, where persistent excitation and, therefore, 
parameter convergence are not guaranteed. Plots (d) show the performance 
of the estimator when the measurement x(t) is corrupted by d(t) = 0 . 1 n(f), 
where n(t) is a normally distributed white noise. 

It is clear from Figures 4.2 (a,b) and Figure 4.3 (a,b) that the use of a 
larger value of the adaptive gain 7 led to a faster convergence of a and b 
to their true values. The lack of parameter convergence to the true values 
in Figure 4.2 (c), 4.3 (c) is due to the use of a non-PE input signal. As 
expected, the parameter estimates are more biased in the case of the series- 
parallel estimator shown in Figure 4.3 (d) than those of the parallel one 
shown in Figure 4.2 (d). 

4.2.3 Vector Case 

Let us extend the example of Section 4.2.2 to the higher-order case where 
the plant is described by the vector differential equation 

x = A p x + B p u 



(4.2.23) 



a 


a 


b 


b 


a 


a 


b 


b 


Figure 4.2 Simulation results of the parallel estimator, (a) u = sin5f, 7=1, no 
measurement noise; (b) u = sin 5 1 , 7 = 5, no measurement noise; (c) u =unit 
step function, 7 = 1, no measurement noise; (d) u = sin5f, 7 = 1 , output x is 
corrupted by d(t) = 0.1?i(t), where n(t) is a normally distributed white noise. 


where the state x 6 TZ n and input u € TZ r are available for measurement, 
A p € lZ nxn ,B p £ TZ nxr are unknown, A p is stable, and u E Coo- As in the 
scalar case, we form the parallel model 

x = A p x + B p u, x £ lZ n (P) 

where A p (t), B p (t ) are the estimates of A p , B p at time t to be generated by 
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Figure 4.3 Simulation results of the series-parallel estimator, (a) u = sin5f, 
7 = 1, no measurement noise; (b) u = sin5f, 7 = 5, no measurement noise; 
(c) u =unit step function, 7 = 1, no measurement noise; (d) u = sin 5t, 7 = 1, 
output x is corrupted by d(t) = 0 . 1 n(t), where n(t) is the normally distributed 
white noise. 

an adaptive law, and x(t) is the estimate of the vector x(t). Similarly, by 
considering the plant parameterization 

x — d - -^- 77 i)x -j- B p u 

where A m is an arbitrary stable matrix, we define the series-parallel model 


as 


x — Ay)iX -}- i^Ap Arff^x B p u 


(SP) 
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The estimation error vector e\ defined as 

A 

€\ — X — X 


satisfies 


for model (P) and 


e\ = A p e i — A p x — B p u 


e± = A m e i - A p x - B p u 


(PI) 

(SP1) 


for model (SP), where A p = A p — A p . B p = B p — B p . 

Let us consider the parallel model design and use (PI) to derive the 
adaptive law for estimating the elements of A p , B p . We assume that the 
adaptive law has the general structure 


A p = Fi(ei,x,x,u), B p = F 2 (e 1 ,x,x,u) (4.2.24) 


where Fi and F 2 are functions of known signals that are to be chosen so that 
the equilibrium 

— Ap . B pe — B p , 6ie — 0 

of (PI), (4.2.24) has some desired stability properties. We start by consid- 
ering the function 


V(ei,A p ,B p ) 


eJPe i + tr 


AjPA p \ 

7i ) 


+ tr 


BjPBp\ 

72 j 


(4.2.25) 


where tr(^4) denotes the trace of a matrix A, 71, 72 > 0 are constant scalars, 
and P = P T > 0 is chosen as the solution of the Lyapunov equation 


PA„ + A'P= -I 


(4.2.26) 


whose existence is guaranteed by the stability of A p (see Theorem 3.4.10). 
The time derivative V of V along the trajectory of (PI), (4.2.24) is given by 


+tr i Ai^ + i£^| +tr 


71 


71 


B p B B p B p P B p 


72 


72 
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which after substituting for e\ ,A p ,B p becomes 

i- x ~r ~ t ~ ( AlPFi bJpf 2 \ 

V = eJ(PAp + XIP)e 1 -2eJPApX-2eJPB p u + tr 2 -?■ + 2 -? 

V 7i 72 / 

(4.2.27) 

We use the following properties of trace to manipulate (4.2.27): 

(i) tr(AB) = tr (BA) 

(ii) tr(A + B) = tr(yl) + tr(L>) for any A,B& 7 Z nxn 

(iii) tr (yx T ) = x T y for any x, y G IZ nxl 

We have 

ej PA p x = x T Ap Pe\ = ti(Ap Pe\x T ) 
ej PB p u = ti(Bj Peiu T ) 

and, therefore, 


V = — ej €\ + 2tr 


(AlPF l 


V 


7i 


- AlPeix T + 


BpPF 2 

72 


- Bj Pern 1 


(4.2.28) 


The obvious choice for F \ , F 2 to make V negative is 

A p = Fi = 7ieix T , B p = F -2 = 7261 u T (4.2.29) 

In the case of the series-parallel model, we choose 

- - t ( AlPA v \ (BjPB v \ 

V(e 1 ,A p ,Bp)=eJPe 1 + tr( p p \ + tr ( p P \ (4.2.30) 

where P = P T > 0 is the solution of the Lyapunov equation 

A T m P + PA m = -I (4.2.31) 

By following the same procedure as in the case of the parallel model, we 
obtain 

A p = 7ieix T , B p = 7261 u T (4.2.32) 
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Remark 4.2.1 If instead of (4.2.30), we choose 

V ( e i) A p , B p ) = ej Pei + tr + tr (^Bj B p ^j (4.2.33) 

where P = P T > 0 satisfies (4.2.31), we obtain 

A p = Pe i.t t , B p = Peiu T (4.2.34) 

In this case P is the adaptive gain matrix and is calculated using 
(4.2.31). Because A m is a known stable matrix, the calculation of P 
is possible. It should be noted that if we use the same procedure for 
the parallel model, we will end up with an adaptive law that depends 
on P that satisfies the Lyapunov equation (4.2.26) for A p . Because 
Ap is unknown, P cannot be calculated; therefore, the adaptive laws 
corresponding to (4.2.34) for the parallel model are not implement able. 

The time derivative V of V in both the parallel and series-parallel esti- 
mators satisfies 

V = —ej e± < 0 

which implies that the equilibrium A pe = A p , B pe = B p ,e i e = 0 of the respec- 
tive equations is u.s. Using arguments similar to those used in Section 4.2.2 
we establish that ei £ £ 2 , ei £ Loo and that 

|ei(f)|— >0, ||l p (i)|| 0, j|R p (i)|| — ► 0 ast-^oo 

The convergence properties of A p , B p to their true values A p ,B p , respectively, 
depend on the properties of the input u. As we will discuss in Chapter 5, if u 
belongs to the class of sufficiently rich inputs, i.e. , u has enough frequencies 
to excite all the modes of the plant, then the vector [x T ,u T ] T is PE and 
guarantees that A p ,B p converge to A p ,B p , respectively, exponentially fast. 

4.2.4 Remarks 

(i) In this section we consider the design of on-line parameter estimators 
for simple plants that are stable, whose states are accessible for mea- 
surement and whose input u is bounded. Because no feedback is used 
and the plant is not disturbed by any signal other than u, the stability 
of the plant is not an issue. The main concern, therefore, is the stability 
properties of the estimator or adaptive law that generates the on-line 
estimates for the unknown plant parameters. 



162 


CHAPTER 4. ON-LINE PARAMETER ESTIMATION 


(ii) For the examples considered, we are able to design on-line parameter es- 
timation schemes that guarantee that the estimation error e\ converges 
to zero as t — > oo, i.e. , the predicted state x approaches that of the plant 
as t — > oo and the estimated parameters change more and more slowly 
as time increases. This result, however, is not sufficient to establish 
parameter convergence to the true parameter values unless the input 
signal u is sufficiently rich. To be sufficiently rich, u has to have enough 
frequencies to excite all the modes of the plant. 

(iii) The properties of the adaptive schemes developed in this section rely 
on the stability of the plant and the boundedness of the plant input u. 
Consequently, they may not be appropriate for use in connection with 
control problems where u is the result of feedback and is, therefore, no 
longer guaranteed to be bounded a priori . In the following sections, we 
develop on-line parameter estimation schemes that do not rely on the 
stability of the plant and the boundedness of the plant input. 


4.3 Adaptive Laws with Normalization 

In Section 4.2, we used several simple examples to illustrate the design of 
various adaptive laws under the assumption that the full state of the plant 
is available for measurement, the plant is stable, and the plant input is 
bounded. In this section, we develop adaptive laws that do not require the 
plant to be stable or the plant input to be bounded a priori. Such adaptive 
laws are essential in adaptive control, to be considered in later chapters, 
where the stability of the plant and the boundedness of the plant input are 
properties to be proven and, therefore, cannot be assumed to hold a priori. 

We begin with simple examples of plants whose inputs and states are not 
restricted to be bounded and develop adaptive laws using various approaches. 
These results are then generalized to a wider class of higher-order plants 
whose output rather than the full state vector is available for measurement. 


4.3.1 Scalar Example 

Let us consider the simple plant given by the algebraic equation 


y (t) = 6*u(t ) 


(4.3.1) 
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where u and, therefore, y are piecewise continuous signals but not necessarily 
bounded, and 9* is to be estimated by using the measurements of y and u. 
As in Section 4.2.1, we can generate the estimate y of y and the estimation 
error e± as 

y = 9u 

ei = y-y = y-9u 

where 9{t ) is the estimate of 9* at time t. 

Because u and y are not guaranteed to be bounded, the minimization 
problem 

■ ■ ( y~ 0 u ) 2 

e v ' e 2 

is ill posed and, therefore, the procedure of Section 4.2.1 where iij £ Too 
does not extend to the case where u,y 0 Coo. This obstacle is avoided by 
dividing each side of (4.3.1) with some function referred to as the normalizing 
signal m > 0 to obtain 

y = 9*u (4.3.2) 

where 

V - u 

V = ~ , u = — 

m m 

are the normalized values of y and u, respectively, and m 2 = 1 + n 2 . The 
signal n s is chosen so that — £ Coo. A straightforward choice for n s is 
n s = u, i.e. , m 2 = 1 + u 2 . 

Because u, y £ Coo , w e can follow the procedure of Section 4.2.1 and 
develop an adaptive law based on (4.3.2) rather than on (4.3.1) as follows: 
The estimated value y of y is generated as 

y = 9u 

and the estimation error as 


d = y - y = y - 9u 

It is clear that e\ = — = v ~ u and y = y/m. The adaptive law for 9 is now 
developed by solving the well-posed minimization problem 


min J ( 9) 

e 


. (y - Ouf 

= nun 


e 2 


= min 

e 


(y ~ Qu ) 2 

2 m 2 
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Using the gradient method we obtain 

0 = 7Ci u , 7 > 0 


or in terms of the unnornralized signals 


0 = 7 


€\U 


m- 

where m may be chosen as m 2 = 1 + u 2 . 

For clarity of presentation, we rewrite (4.3.3) and (4.3.4) as 


0 = 7 cu 


(4.3.3) 


(4.3.4) 


(4.3.5) 


where 


ei 


2 

m z 

We refer to e as the normalized estimation error. This notation enables us 
to unify results based on different approaches and is adopted throughout the 
book. 

Let us now analyze (4.3.5) by rewriting it in terms of the parameter error 
6 = 6 — 6*. Using 6 = 6 and e = ei/m 2 = — we have 



(4.3.6) 


We propose the Lyapunov function 

02 
27 


V(6) 


whose time derivative along the solution of (4.3.6) is given by 


V = —9 2 u 2 = —e 2 m 2 < 0 


Hence, 0, 0 6 Coo and era £ £2- Because 0, u £ £«,, it follows from e = — 0^ 
that e,em £ Coo. If we now rewrite (4.3.5) as 0 = r yemu, it follows from 
em £ Coo f) £2 and u £ Coo that 0 £ £00 D £2- Since jjjem = — 6 u — 6 u 

and 0,0, u £ Coo , it follows that for u £ £oo we have |(em) £ £oo, which, 
together with em £ £2, implies that em — ► 0 as t — ► 00. This, in turn, 
implies that 0 — ► 0 as t — > 00. 
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The significance of this example is that even in the case of unbounded y. u 
we are able to develop an adaptive law that guarantees bounded parameter 
estimates and a speed of adaptation that is bounded in an C 2 and Too sense 
(be., 0 E Too fl T 2 ). 

When n s = 0, i.e. , m = 1 the adaptive law (4.3.5) becomes the unnor- 
malized one considered in Section 4.2. It is obvious that for m = 1, in 
(4.3.5), i.e., e = ei, we can still establish parameter boundedness, but we 
can not guarantee boundedness for 6 in an C p sense unless u E C 0 c . As 
we will demonstrate in Chapter 6, the property that 9 E £2 is crucial for 
stability when adaptive laws of the form (4.3.5) are used with control laws 
based on the certainty equivalence principle to stabilize unknown plants. 



(4.3.11) 
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as in Section 4.2.2. The time derivative of V along (4.3.9) and (4.3.10) is 
given by 

V = —a m e\ < 0 (4.3.12) 

Because x is not necessarily bounded, it cannot be treated as an independent 
bounded function of time in (4.3.10) and, therefore, (4.3.10) cannot be de- 
coupled from (4.3.8). Consequently, (4.3.8) to (4.3.10) have to be considered 
and analyzed together in IZ 3 , the space of e\ ,a,x. The chosen function V 
in (4.3.11) is only positive semidefinite in IZ 3 , which implies that V is not a 
Lyapunov function; therefore, Theorems 3.4.1 to 3.4.4 cannot be applied. V 
is, therefore, a Lyapunov-like function, and the properties of V, V allow us 
to draw some conclusions about the behavior of the solution e\ (t), a(t) with- 
out having to apply the Lyapunov Theorems 3.4.1 to 3.4.4. From V > 0 and 
V = —a m e 2 < 0 we conclude that V £ £oo, which implies that ei,a £ Coo, 
and ei £ £ 2 - Without assuming x £ however, we cannot establish any 
bound for a in an C p sense. 

As in Section 4.3.1, let us attempt to use normalization and modify 
(4.3.9) to achieve bounded speed of adaptation in some sense. The use of 
normalization is not straightforward in this case because of the dynamics 
introduced by the transfer function — ^ — , i.e., dividing each side of (4.3.7) 
by m may not help because 


(®m 

For this case, we propose the error signal 

1 2 

e = x — x en s = 

S + Clm. 


m s T a r 


a) 1- 

m 


u 

m 


(ax — en 2 s ) 


(4.3.13) 


i.e., 

e = —a m e + ax — en 2 

where n s is a normalizing signal to be designed. 

Let us now use the error equation (4.3.13) to develop an adaptive law for 
a. We consider the Lyapunov-like function 



(4.3.14) 
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whose time derivative along the solution of (4.3.13) is given by 

V = —a m e 2 — e 2 n 2 + aex + aa 


Choosing 


a = a = —ex 


(4.3.15) 


we have 

V = —a m e 2 — e 2 n 2 < 0 

which together with (4.3.14) imply V, e,a G Cx and e,en s € £ 2 . If we now 
write (4.3.15) as 

. j i 

a = —em — 
m 

where m 2 = 1 + n 2 and choose n s so that ^ G /loo, then em. G £2 (because 
e,en s G £ 2 ) implies that a G £ 2 - A straightforward choice for n s is n s = x, 
i.e. , m 2 = 1 + x 2 . 

The effect of n s can be roughly seen by rewriting (4.3.13) as 

e = —a m e — en 2 + ax (4.3.16) 


and solving for the “quasi” steady-state response 




ax 

a m + n 2 


(4.3.17) 


obtained by setting e ~ 0 in (4.3.16) and solving for e. Obviously, for 
n 2 = x 2 , large e s implies large a independent of the boundedness of x, which, 
in turn, implies that large e s carries information about the parameter error 
a even when x 0 £00 • This indicates that n s may be used to normalize the 
effect of the possible unbounded signal x and is, therefore, referred to as 
the normalizing signal. Because of the similarity of e s with the normalized 
estimation error defined in (4.3.5), we refer to e in (4.3.13), (4.3.16) as the 
normalized estimation error too. 


Remark 4.3.1 The normalizing term en 2 in (4.3.16) is similar to the non- 
linear “damping” term used in the control of nonlinear systems [99]. 
It makes V more negative by introducing the negative term —e 2 n 2 in 
the expression for V and helps establish that en s G £ 2 - Because a is 



CHAPTF, P d DN-TJNF, PAP AMFTF.P FSTTMATTDN 


1fi« 
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Figure 4.4 Effect of normalization on the convergence and performance of 
the adaptive law (4.3.15). 


bounded from above by e\/nf + 1 = cm and em G £ 2 , we can con- 
clude that a G £ 2 , which is a desired property of the adaptive law. 
Note, however, that a G £2 does not imply that a G Coo ■ In contrast 
to the example in Section 4.3.1, we have not been able to establish 
that a G £00 • As we will show in Chapter 6 and 7, the £2 property 
of the derivative of the estimated parameters is sufficient to establish 
stability in the adaptive control case. 


Simulations 

Let us simulate the effect of normalization on the convergence and perfor- 
mance of the adaptive law (4.3.15) when a = 0 is unknown, u = sinf, and 
a m = 2. We use n 2 s = ax 2 and consider different values of a > 0. The 
simulation results are shown in Figure 4.4. It is clear that large values of a 
lead to a large normalizing signal that slows down the speed of convergence. 
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4.3.3 General Plant 

Let us now consider the SISO plant 

x = Ax + Bu, x(0) = x 0 o 1fi x 

y = C^x (43 ‘ 18) 

where x € IZ n and only y, u are available for measurement. Equation (4.3.18) 
may also be written as 

y = C T (sI - A)~ 1 Bu + C T (si - A)~ l x 0 


or as 

Z(s) C T adj(sI — A)xq 
^ R(s) R(s) 


where Z(s),R(s) are in the form 


(4.3.19) 


Z(s ) — b n -is n 1 + b n — 2 S n 2 + • • • + b\s + bo 

R(s) = s n + a n —is n i + • • • + a-is + cio 

The constants di,bi for i = 0,1,..., n — 1 are the plant parameters. A 
convenient parameterization of the plant that allows us to extend the results 
of the previous sections to this general case is the one where the unknown 
parameters are separated from signals and expressed in the form of a linear 
equation. Several such parameterizations have already been explored and 
presented in Chapter 2. We summarize them here and refer to Chapter 2 
for the details of their derivation. 

Let 

— [bn— 1 , b n — 2 , •••,&]., 6o, 1) 0>n— 2> ■ • • j ®lj d oj 

be the vector with the unknown plant parameters. The vector 9* is of di- 
mension 2 n. If some of the coefficients of Z(s) are zero and known, i.e. , 
Z(s) is of degree m < n — 1 where m is known, the dimension of 9* may be 
reduced. Following the results of Chapter 2, the plant (4.3.19) may take any 
one of the following parameterizations: 


z = 9 * T 4 > + Vo 
y = o*\4> + vo 


(4.3.20) 

(4.3.21) 
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y = W(s)ef^ + Vo 


(4.3.22) 


where 


z = W 1 (s)y, <t> = H(s ) 


u 

y 


= Hi(s) 


u 

y 


n o = Cq e Act B 0 x 0 


9{ = 9*- b\ 

W\(s), H(s), Hi(s) are some known proper transfer function matrices with 
stable poles, b\ = [0, A T ] T is a known vector, and A c is a stable matrix which 
makes to be an exponentially decaying to zero term that is due to non- 
zero initial conditions. The transfer function W(s) is a known strictly proper 
transfer function with relative degree 1, stable poles, and stable zeros. 

Instead of dealing with each parametric model separately, we consider 
the general model 

z = W(s)0* t i/j + 7/o (4.3.23) 

where W(s) is a proper transfer function with stable poles, z € £ R 2n 

are signal vectors available for measurement and y o = Cq e Act Boxo. Initially 
we will assume that go = 0, i.e., 


z = W{s)6* T ip 


(4.3.24) 


and use (4.3.24) to develop adaptive laws for estimating 6* on-line. The effect 
of ?7o and, therefore, of the initial conditions will be treated in Section 4.3.7. 

Because 6* is a constant vector, going from form (4.3.23) to form (4.3.20) 
is trivial, i.e., rewrite (4.3.23) as z = 9* T W + r/o and define <fi = W(s)^. 
As illustrated in Chapter 2, the parametric model (4.3.24) may also be a 
parameterization of plants other than the LTI one given by (4.3.18). What 
is crucial about (4.3.24) is that the unknown vector 0* appears linearly in 
an equation where all other signals and parameters are known exactly. For 
this reason we will refer to (4.3.24) as the linear parametric model. In the 
literature, (3.4.24) has also been referred to as the linear regression model. 

In the following section we use different techniques to develop adaptive 
laws for estimating 9* on-line by assuming that W(s) is a known, proper 
transfer function with stable poles, and z, ^ are available for measurement. 
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4.3.4 SPR-Lyapunov Design Approach 

This approach dominated the literature of continuous adaptive schemes [48, 
149, 150, 153, 172, 178, 187]. It involves the development of a differential 
equation that relates the estimation or normalized estimation error with 
the parameter error through an SPR transfer function. Once in this form 
the KYP or the MKY Lemma is used to choose an appropriate Lyapunov 
function V whose time derivative V is made nonpositive, i.e. , V < 0 by 
properly choosing the differential equation of the adaptive law. 

The development of such an error SPR equation had been a challeng- 
ing problem in the early days of adaptive control [48, 150, 153, 178]. The 
efforts in those days were concentrated on finding the appropriate transfor- 
mation or generating the appropriate signals that allow the expression of the 
estimation/parameter error equation in the desired form. 

In this section we use the SPR-Lyapunov design approach to design adap- 
tive laws for estimating 9* in the parametric model (4.3.24). The connection 
of the parametric model (4.3.24) with the adaptive control problem is dis- 
cussed in later chapters. By treating parameter estimation independently 
of the control design, we manage to separate the complexity of the estima- 
tion part from that of the control part. We believe this approach simplifies 
the design and analysis of adaptive control schemes, to be discussed in later 
chapters, and helps clarify some of the earlier approaches that appear tricky 
and complicated to the nonspecialist. 

Let us start with the linear parametric model 

z = W(s)e* T ip (4.3.25) 

Because 9* is a constant vector, we can rewrite (4.3.25) in the form 

z = W(8)L(s)e* T <f> (4.3.26) 

where 

4 >= L~ l {s)^ 

and L(s) is chosen so that L~ 1 (s) is a proper stable transfer function and 
W(s)L(s) is a proper SPR transfer function. 

Remark 4.3.2 For some W(s) it is possible that no L(s) exists such that 
W(s)L(s) is proper and SPR. In such cases, (4.3.25) could be prop- 
erly manipulated and put in the form of (4.3.26). For example, when 
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W(s) = no L(s) can be found to make W(s)L(s) SPR. In this 
case, we write (4.3.25) as z = ( S+ 2 ) I (^-|- 3 ) ^* T ^ ) where (f> = fipjV’ and 
z = ir^z. The new W(s) in this case is W(s) = ^ + 2 ^ +3 ) and a wide 
class of L(s) can be found so that WL is SPR. 

The significance of the SPR property of W(s)L(s) is explained as we 
proceed with the design of the adaptive law. 

Let 9(t) be the estimate of 9* at time t. Then the estimate z of z at time 
t is constructed as 

z = W(s)L(s)9 T (j) (4.3.27) 

As with the examples in the previous section, the estimation error ei is 
generated as 

ei = z — z 


and the normalized estimation error as 

,2 


e = z — z — W (s)L(s)en 2 = e\ — W(s)L(s)en 2 
where n s is the normalizing signal which we design to satisfy 


Y r 2 i,2 

— 6 -LoO) m — 1 + Tig 

m 


(4.3.28) 


(Al) 


Typical choices for n s that satisfy (Al) are n 2 s = 0 T </>, n 2 = 4> T P(fi for 
any P = P T > 0, etc. When <fr G Coo, (Al) is satisfied with m = 1, i.e., 
n s = 0 in which case e = ei. 

We examine the properties of e by expressing (4.3.28) in terms of the 
parameter error 9 = 9 — 8*, i.e., substituting for z, z in (4.3.28) we obtain 

e = WL(—9 T (j) - en 2 ) (4.3.29) 

For simplicity, let us assume that L(s ) is chosen so that WL is strictly proper 
and consider the following state space representation of (4.3.29): 


e = A c e + B c (—9 T (j) — en 2 ) 
e = Cje 


(4.3.30) 


where A c , B c , and C c are the matrices associated with a state space repre- 
sentation that has a transfer function W(s)L(s) = Cj (si — A C )~ 1 B C . 
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The error equation (4.3.30) relates e with the parameter error 0 and is 
used to construct an appropriate Lyapunov type function for designing the 
adaptive law of 0. Before we proceed with such a design, let us examine 
(4.3.30) more closely by introducing the following remark. 


Remark 4.3.3 The normalized estimation error e and the parameters A c , 
B c , and C c in (4.3.30) can be calculated from (4.3.28) and the knowl- 
edge of WL, respectively. However, the state error e cannot be mea- 
sured or generated because of the unknown input 0 T 4>. 


Let us now consider the following Lyapunov-like function for the differ- 
ential equation (4.3.30): 


V0,e) 


e T P c e 0 t T~ 1 0 

— H 

2 2 


(4.3.31) 


where T = T t > 0 is a constant matrix and P c = Pj > 0 satisfies the 
algebraic equations 


P C A C + AjP c = -qq T - vL c 
P C B C = C c 


(4.3.32) 


for some vector q, matrix L c = Lj > 0 and a small constant v > 0. Equation 
(4.3.32) is guaranteed by the SPR property of W(s)L(s) = Cj (si — A c )~ l B c 
and the KYL Lemma if (A c , B c ,C c ) is minimal or the MKY Lemma if 
(A c , B c , C c ) is nonminimal. 


Remark 4.3.4 Because the signal vector cj) in (4.3.30) is arbitrary and could 
easily be the state of another differential equation, the function (4.3.31) 
is not guaranteed to be positive definite in a space that includes (f>. 
Hence, V is a Lyapunov-like function. 

The time derivative V along the solution of (4.3.30) is given by 
V(0, e) = ~ e T qq T e - ^e T L c e + e T P c B c [-0 J (j) - en 2 s ] + 0 T W l 0 (4.3.33) 

We now need to choose 0 = 0 as a function of signals that can be measured 
so that the indefinite terms in V are canceled out. Because e is not available 
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for measurement, 9 cannot depend on e explicitly. Therefore, at first glance, 
it seems that the indefinite term — e T P C B C 9 T <f> = —9 T (f>e T P C B C cannot be 
cancelled because the choice 9 = 6 = T^>e T P C B C is not acceptable due to the 
presence of the unknown signal e. 

Here, however, is where the SPR property of WL becomes handy. We 
know from (4.3.32) that P C B C = C c which implies that e T P C B C = e T C c = e. 
Therefore, (4.3.33) can be written as 

V ( 9 , e) = — -e T qq T e — —e T L c e — e9 T cj) — e 2 n 2 (4.3.34) 

The choice for 9 = 9 to make V < 0 is now obvious, i.e. , for 

9 = 9 = Tecj) (4.3.35) 

we have 

V(9,e) = — -e T qg T e — -e T L c e — e 2 n 2 < 0 (4.3.36) 

which together with (4.3.31) implies that V,e,e,9,9 £ and that 

lim V(9(t),e(t)) = Tx, < 

t^-OO 

Furthermore, it follows from (4.3.36) that 

r e 2 n 2 dr + £ f°° e T L c edr < V{9{0), e(0)) - Hoc (4.3.37) 

Jo 2 Jo 

Because A mm (L c )|e| 2 < e T L c e and V(9( 0), e(0)) is finite for any finite initial 
condition, (4.3.37) implies that en s ,e £ £2 and therefore e = Cje £ £ 2 - 
From the adaptive law (4.3.35), we have 

\9\ < ||r|| leml — 
m 

where m 2 = 1 + n 2 . Since e 2 m 2 = e 2 + e 2 n 2 and e, en s £ £2 we have that 
em £ £ 2 , which together with ^ £ £ oo implies that 9 = 9 £ £ 2 . 

We summarize the properties of (4.3.35) by the following theorem. 


Theorem 4.3.1 The adaptive law (4-3.35) guarantees that 
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Figure 4.5 Block diagram for implementing adaptive algorithm (4.3.35) 
with normalized estimation error. 

(i) 9, e G £oo 

(ii) e,en s ,9<EC 2 

independent of the boundedness properties of 4>. 


Remark 4.3.5 Conditions (i) and (ii) of Theorem 4.3.1 specify the quality 
of estimation guaranteed by the adaptive law (4.3.35). In Chapters 6 
and 7, we will combine (4.3.35) with appropriate control laws to form 
adaptive control schemes. The stability properties of these schemes 
depend on the properties (i) and (ii) of the adaptive law. 

Remark 4.3.6 The adaptive law (4.3.35) using the normalized estimation 
error generated by (4.3.28) can be implemented using the block di- 
agram shown in Figure 4.5. When W(s)L(s) = 1, the normalized 
estimation error becomes e = ei/(l + n 2 ) with e\ = z — z, which is the 
same normalization used in the gradient algorithm. This result can be 
obtained using simple block diagram transformation, as illustrated in 
Figure 4.6. 

Remark 4.3.7 The normalizing effect of the signal n s can be explained 
by setting e = 0 in (4.3.30) and solving for the “quasi” steady-state 
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e z — z 

1 

e 


1 + n 2 s 



Figure 4.6 Two equivalent block diagrams for generating the normalized 
estimation error when W(s)L(s) = 1. 


response e ss of e, i.e., 

_ a(—9 T (f>) _ e lss 
ss 1 + an 2 1 + cm 2 


(4.3.38) 


where a = —CjA~ 1 B c is positive, i.e., a > 0, because of the SPR 
property of WL and ei ss is the “quasi” steady state response of e\. 
Because of n s , e ss cannot become unbounded as a result of a possibly 
unbounded signal cj). Large e ss implies that 9 is large; therefore, large 
e carries information about 9, which is less affected by <j). 


Remark 4.3.8 The normalizing signal n s may be chosen as n 2 s = (j) T cj) or 
as n 2 s = (j) T P(t)<j) where P(t ) = P T {t) > 0 has continuous bounded 
elements. In general, if we set n s = 0 we cannot establish that 0 £ £2 
which, as we show in later chapters, is a crucial property for estab- 
lishing stability in the adaptive control case. In some special cases, we 
can afford to set n s = 0 and still establish that 9 € £ 2 - For example, 
if 4> G Loo or if 9, e 6 Loo implies that ^ G Loo, then it follows from 
(4.3.35) that e G £2 =4- 9 G £ 2 - 

When n 2 s = 0, i.e., m = 1, we refer to (4.3.35) as the unnormalized 
adaptive law. In this case e = ei leading to the type of adaptive 
laws considered in Section 4.2. In later chapters, we show how to use 
both the normalized and unnornralized adaptive laws in the design of 
adaptive control schemes. 


Another desired property of the adaptive law (4.3.35) is the convergence 
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of 9(t) to the unknown vector 6*. Such a property is achieved for a special 
class of vector signals (p described by the following definition: 

Definition 4.3.1 (Persistence of Excitation (PE)) A piecewise contin- 
uous signal vector cp : IZ + i— > IZ n is PE in IZ n with a level of excitation 
cco > 0 if there exist constants «i,To > 0 such that 

1 rt+To 

a\ I > — / (p{j)(p ( r)dr > ao I, Vt > 0 (4.3.39) 

To Jt 

Although the matrix <p(r)(p T (r) is singular for each r, (4.3.39) requires that 
4>{t) varies in such a way with time that the integral of the matrix <^>(t)(/> t (t) 
is uniformly positive definite over any time interval [t, t + To]. 

If we express (4.3.39) in the scalar form, i.e., 

ai > — / {q T < P ( t )) 2 dr > o 0 , Vf > 0 (4.3.40) 

Jo Jt 

where q is any constant vector in IZ n with |(/| = 1, then the condition can 
be interpreted as a condition on the energy of <f> in all directions. The 
properties of PE signals as well as various other equivalent definitions and 
interpretations are given in the literature [1, 12, 22, 24, 52, 75, 127, 141, 171, 
172, 201, 242], 

Corollary 4.3.1 If n s ,cp, cp 6 and 4> is PE, then (4-3.35) guarantees 
that 9(t) — > 9* exponentially fast. 

The proof of Corollary 4.3.1 is long and is given in Section 4.8. 

Corollary 4.3.1 is important in the case where parameter convergence is 
one of the primary objectives of the adaptive system. We use Corollary 4.3.1 
in Chapter 5 to establish parameter convergence in parameter identifiers and 
adaptive observers for stable plants. 

The condition that f> appears only in the case of the adaptive laws based 
on the SPR-Lyapunov approach with W(s)L(s) strictly proper. It is a con- 
dition in Lemma 4.8.3 (iii) that is used in the proof of Corollary 4.3.1 in 
Section 4.8. 

The results of Theorem 4.3.1 are also valid when W(s)L(s) is biproper 
(see Problem 4.4). In fact if W(s) is minimum phase, one may choose T(s) = 
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W x (s) leading to W(s)L(s) = 1. For WL = 1, (4.3.28), (4.3.29) become 

z — z 9 T (f> 


e = 


m. m- 

where m 2 = 1 + n 2 s . In this case we do not need to employ the KYP or MKY 
Lemma because the Lyapunov-like function 


V(0) = 


leads to 


V = —em 




2_2 


by choosing 

9 = Tec/) (4.3.41) 

The same adaptive law as (4.3.41) can be developed by using the gradient 
method to minimize a certain cost function of e with respect to 9. We discuss 
this method in the next section. 


Example 4.3.1 Let us consider the following signal 

y = Tsin(o;t + ip) 


that is broadcasted with a known frequency w but an unknown phase ip and un- 
known amplitude A. The signal y is observed through a device with transfer function 
W(s) that is designed to attenuate any possible noise present in the measurements 
of y, i.e., 

z = W(s)y = W(s)Asin(ujt + ip) (4.3.42) 

For simplicity let us assume that W(s) is an SPR transfer function. Our objective 
is to use the knowledge of the frequency w and the measurements of z to estimate 
A, ip. Because A, p may assume different constant values at different times we would 
like to use the results of this section and generate an on-line estimation scheme that 
provides continuous estimates for A, ip. The first step in our approach is to transform 
(4.3.42) in the form of the linear parametric model (4.3.24). This is done by using 
the identity 

A sin(wt + <p) = A 1 sin tot + A 2 cos uit 


where 

A 1 = Acosip, A-2 = Asvn.<p (4.3.43) 

to express (4.3.42) in the form 


z = W{s)d* T (j> 
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where 0* = [Hi,H 2 ] t and <f> = [sinwi,cosw<] T . 

From the estimate of 6 *, i.e., A\,A 2 , we can calculate the estimate of A,tp by 
using the relationship (4.3.43). Using the results of this section the estimate 9(t) of 
9* at each time t is given by 

9 = Te<t> 

e = z — z — W{s)en 2 s , z = W{s)6 T <j>, n% = a<f> T <j> 

where 9 = [Ai,H 2 ] t and Ai,A 2 is the estimate of Ax,A 2 , respectively. Since 
4> £ Coo, the normalizing signal may be taken to be equal to zero, i.e., a = 0. The 
adaptive gain T may be chosen as T = diag( 7) for some 7 > 0, leading to 

A\ = yesinwf, A 2 = ye cos wt (4.3.44) 


The above adaptive law guarantees that A 1 ,A 2 ,e € C 00 and e,Ai,A 2 £ C 2 . Since 
4> £ Coo w e also have that A \ , A 2 £ Coo ■ As we mentioned earlier the convergence 
of Ai,A 2 to Ai,A 2 respectively is guaranteed provided <f> is PE. We check the PE 
property of (f) by using (4.3.39). We have 


1 

To 



(f){T)cj) T (r) dr 


^ rS(t,T 0 ) 
Jo 


where 


S(t,T 0 ) = 


T 0 sin 2 u>(t + T 0 ) — sin 2 u>t 

y " iw 

cos 2 uj(t + T 0 ) — cos 2 ut 
4u> 


cos 2 ui(t + T 0 ) — cos 2 cut 
4 u 

T 0 | sin 2 u)(t + T 0 ) — sin 2 cot 

y + y 


For Tq = — , we have 


1 /»t+T 0 

— / </>(r)</) T (r)dT = 

Jo Jt 


z 0 

u 

0 ^ , 

u) J 


Hence, the PE condition (4.3.39) is satisfied with T 0 = 0 < a 0 < ir/w, ot\ > 

— ■ therefore, <j) is PE, which implies that A\,A 2 converge to A\,A 2 exponentially 
fast. 

Using (4.3.43), the estimate A, dp of A, p>, respectively, is calculated as follows: 


m = ^Ai(t)+Ai(t), 



(4.3.45) 


The calculation of (p at each time t is possible provided A(t) 7^ 0. This implies 
that A 1 , A 2 should not go through zero at the same time, which is something that 
cannot be guaranteed by the adaptive law (4.3.44). We know, however, that Ai,A 2 
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converge to Hi, 4.2 exponentially fast, and Ai,A 2 cannot be both equal to zero 
(otherwise y = 0 Vt > 0). Hence, after some finite time T, A\ , A 2 will be close 
enough to A±,A 2 for A(T) ^ 0 to imply A(t) ^ 0 Vt > T. 

Because Ai,A 2 — > Ai,A% exponentially fast, it follows from (4.3.45) that A,<p 
converge to A, exponentially fast. 

Let us simulate the above estimation scheme when W(s) = to = 2 rad/sec 
and the unknown A, <p are taken as A = 10, <p = 16° = 0.279 rad for 0 < t < 20 sec 
and A = 7, <p = 25° = 0.463 rad for t > 20 sec for simulation purposes. The results 
are shown in Figure 4.7, where 7 = 1 is used. \7 


Example 4.3.2 Consider the following plant: 


V = 


b±s + b 0 

II 

s 2 + 3s + 2 


where b\ 1 bo are the only unknown parameters to be estimated. We rewrite the plant 
in the form of the parametric model (4.3.24) by first expressing it as 


y = 


1 

(s + 1) (s + 2) 


0* T ip 


(4.3.46) 


where 6*= [61, 5 0 ] T , ip= [u, w] T . We then choose L(s) = s + 2 so that W(s)L(s) = jxt 
is SPR and rewrite (4.3.46) as 

y = ( 4 - 3 - 47 ) 

s + 1 


where (f) = can g enera t e d by filtering u. Because (4.3.47) is in 

the form of parametric model (4.3.24), we can apply the results of this section to 
obtain the adaptive law 

0 = Te(f> 


e = y l —{0 T (t) + en 2 s ), n s = a(f> T cj) 

s + 1 

where a > 0 and 0 = [61, 60] T is the on-line estimate of 6*. This example illustrates 
that the dimensionality of 6, (j> may be reduced if some of the plant parameters are 
known. SJ 


4.3.5 Gradient Method 

Some of the earlier approaches to adaptive control in the early 1960s [20, 34, 
96, 104, 115, 123, 175, 220] involved the use of simple optimization techniques 



A 

A 


9 

9 


A ( t ) sin (2t + cp (f) ) 
A sin (2f + <p) 


Figure 4.7 Simulation results for Example 4.3.1. 


such as the gradient or steepest descent method to minimize a certain perfor- 
mance cost with respect to some adjustable parameters. These approaches 
led to the development of a wide class of adaptive algorithms that had found 
wide applications in industry. Despite their success in applications, the 
schemes of the 1960s lost their popularity because of the lack of stability in 
a global sense. As a result, starting from the late 1960s and early 1970s, 
the schemes of the 1960s have been replaced by new schemes that are based 
on Lyapunov theory. The gradient method, however, as a tool for designing 
adaptive laws retained its popularity and has been widely used in discrete- 
time [73] and, to a less extent, continuous-time adaptive systems. In contrast 
to the schemes of the 1960s, the schemes of the 1970s and 1980s that are 
based on gradient methods are shown to have global stability properties. 
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What made the difference with the newer schemes were new formulations of 
the parameter estimation problem and the selection of different cost func- 
tions for minimization. 

In this section, we use the gradient method and two different cost func- 
tions to develop adaptive laws for estimating 9* in the parametric model 

z = W(s)9* T 'ip (4.3.24) 

The use of the gradient method involves the development of an algebraic 
estimation error equation that motivates the selection of an appropriate cost 
function J{9) that is convex over the space of 9(t), the estimate of 9* at time 
t, for each time t. The function J(9 ) is then minimized with respect to 9 
for each time t by using the gradient method described in Appendix B. The 
algebraic error equation is developed as follows: 

Because 9* is constant, the parametric model (4.3.24) can be written in 
the form 

z = 9* T (t> (4.3.48) 

where 4> = W(s)i/j. 

The parametric model (4.3.48) has been the most popular one in discrete 
time adaptive control. At each time t, (4.3.48) is an algebraic equation where 
the unknown 9* appears linearly. Because of the simplicity of (4.3.48), a wide 
class of recursive adaptive laws may be developed. 

Using (4.3.48) the estimate z of z at time t is generated as 



where 9(t) is the estimate of 9* at time t. The normalized estimation error 
e is then constructed as 


z — z 

m 2 


z — 9 J (j> 
m 2 


(4.3.49) 


where m 2 = 1 + n 2 and n s is the normalizing signal designed so that 

— £ Coo (Al) 

m 

As in Section 4.3.4, typical choices for n s are n 2 = <(> T </>, n 2 = (j) T P<j> for 
P = P T > 0, etc. 



4.3. ADAPTIVE LAWS WITH NORMALIZATION 


183 


For analysis purposes we express e as a function of the parameter error 
8 = 8 — 8*, i.e. , substituting for z in (4.3.49) we obtain 

_ 8 T cj) 
m 2 

Clearly the signal em = -8 T -^ is a reasonable measure of the parameter 
error 8 because for any piecewise continuous signal vector cf) (not necessarily 
bounded), large em implies large 8. Several adaptive laws for 8 can be 
generated by using the gradient method to minimize a wide class of cost 
functions of e with respect to 8. In this section we concentrate on two 
different cost functions that attracted considerable interest in the adaptive 
control community. 


(4.3.50) 


Instantaneous Cost Function 


Let us consider the simple quadratic cost function 


J(8) 


e 2 m 2 
2 


(z — 8 T (j)) 2 

2 m 2 


(4.3.51) 


motivated from (4.3.49), (4.3.50), that we like to minimize with respect to 
8. Because of the property (Al) of rri, J(8) is convex over the space of 8 
at each time t; therefore, the minimization problem is well posed. Applying 
the gradient method, the minimizing trajectory 8(t) is generated by the 
differential equation 

8 = —TVJ(8) 

where F = r T > 0 is a scaling matrix that we refer to as the adaptive gain. 
From (4.3.51) we have 


VJ{8) = 


(z — 0 T </>)(/> 

m 2 


and, therefore, the adaptive law for generating 8(t) is given by 


8 = Tecj) 


(4.3.52) 


We refer to (4.3.52) as the gradient algorithm. 
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Remark 4.3.9 The adaptive law (4.3.52) has the same form as (4.3.35) 
developed using the Lyapunov design approach. As shown in Section 
4.3.4, (4.3.52) follows directly from the Lyapunov design method by 
taking L(s) = 4L _1 (s). 

Remark 4.3.10 The convexity of J(9) (as explained in Appendix B) guar- 
antees the existence of a single global minimum defined by VJ(0) = 0. 
Solving VJ(9) = —ecj> = — ^ <t> = 0, be., 4>z = (j)(j) T 6, for 9 will give 

us the nonrecursive gradient algorithm 

9{t) = (# T ) - Vz 

provided that (jxj) T is nonsingular. For <j> 6 IZ nxl and n > 1, c/xp 1 ' is 
always singular, the following nonrecursive algorithm based on N data 
points could be used: 

/ N \ — 1 N 

Q{t) = ( <t>{u)(t> T {L) j <t>{u)z(ti) 

\i=l / i = 1 

where ti < t, i = 1, . . . , N are the points in time where the measure- 
ments of cj) and z are taken. 

Remark 4.3.11 The minimum of J(9 ) corresponds to e = 0, which implies 
0 = 0 and the end of adaptation. The proof that 9{t) will converge 
to a trajectory that corresponds to e being small in some sense is 
not directly guaranteed by the gradient method. A Lyapunov type 
of analysis is used to establish such a result as shown in the proof of 
Theorem 4.3.2 that follows. 


Theorem 4.3.2 The adaptive law (4-3.52) guarantees that 

(i) e,en s ,9,9 e Coo 

(ii) e,en s ,9<EC 2 

independent of the boundedness of the signal vector 0 and 

(iii) ifn s ,(j) G Too a nd (j) is PE, then 9{t ) converges exponentially to 9* 



4.3. ADAPTIVE LAWS WITH NORMALIZATION 


185 


Proof Because 9* is constant, 9 = 9 and from (4.3.52) we have 


9 = Tec/) 


We choose the Lyapunov-like function 


V(9) 


e T r ~ x 8 

2 


Then along the solution of (4.3.53), we have 

V = 9 r (f>e = —e 2 m 2 < 0 


(4.3.53) 


(4.3.54) 


where the second equality is obtained by substituting 9 T (j> = —an 2 from (4.3.50). 
Hence, V, 9 € Coo, which, together with (4.3.50), implies that e, em € /loo- In 
addition, we establish from the properties of V, V, by applying the same argument 
as in the previous sections, that em £ £ 2 , which implies that e, en s € £ 2 - Now from 
(4.3.53) we have 

|0| = |0|<||r|||em|M (4.3.55) 

m 

which together with ^ £ £oo and em £ £2 f) £00 implies that 8 £ £ 2 f) £00 and 
the proof for (i) and (ii) is complete. 

The proof for (iii) is long and more complicated and is given in Section 4.8. □ 


Remark 4.3.12 The property V(d) >0 and V < 0 of the Lyapunov-like 
function implies that lim^oo V(6(t)) = Vx,. This, however, does not 
imply that V (t) goes to zero as t — > 00 . Consequently, we cannot con- 
clude that e or em go to zero as t — * 00 , i.e. , that the steepest descent 
reaches the global minimum that corresponds to VJ(0) = — eef> = 0. 
If however, m/m £ we can establish that |(em) G £oo, 

which, together with em G £ 2 , implies that e(t)m(t) 0 as t — ► 00 . 
Because m 2 = 1 + n 2 we have e(t) — > 0 as t — > 00 and from (4.3.55) 
that 0(t) — > 0 as t 00 . Now |VJ(6*)| < \e(f>\ < |em|^, which implies 
that |VJ(0(t))| —* 0 as t — > 00 , i.e., 0{t) converges to a trajectory that 
corresponds to a global minimum of J(9) asymptotically with time 
provided 

Remark 4.3.13 Even though the form of the gradient algorithm (4.3.52) 
is the same as that of the adaptive law (4.3.35) based on the SPR- 
Lyapunov design approach, their properties are different. For example, 
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(4.3.52) guarantees that 9 € Too, whereas such property has not been 
shown for (4.3.35). 

The speed of convergence of the estimated parameters to their true values, 
when n s ,(j) £ and <f> is PE, is characterized in the proof of Theorem 4.3.2 
(iii) in Section 4.8. It is shown that 

F^r-^t) < 7 n 0 T (o)r- 1 0 (o) 

where 0 < t < nTo, n is an integer and 

_ .. _ 2aoToA m j n (T) 

_71 ’ 7 l “ 2 m 0 + /3 4 T 0 2 A2 iax (r) 

where ao is the level of excitation of (j), Tq > 0 is the size of the time interval 
in the PE definition of </>, mo = sup t>0 m 2 (t) and (3 = sup i>0 |f/>(i)|. We 
established that 0 < 7 < 1. The smaller the 7 , i.e. , the larger the 71 , the 
faster the parameter error converges to zero. The constants ao,To,/3 and 
possibly mo are all interdependent because they all depend on 4>(t). It is, 
therefore, not very clear how to choose 4>(t), if we can, to increase the size 
of 71 . 


Integral Cost Function 

A cost function that attracted some interest in the literature of adaptive 
systems [108] is the integral cost function 

J{9) = \ / e~ / 3 ( *~ r ^e 2 (f,T)m 2 (T)ciT (4.3.56) 

2 Jo 

where /3 > 0 is a design constant and 


e(f,r) 


z{t) - 0 T (t)</>(r) 


m 2 (r) 


e(M) = e 


(4.3.57) 


is the normalized estimation error at time r based on the estimate 6(t ) of 9* 
at time t > r. The design constant /3 acts as a forgetting factor, i.e., as time 
t increases the effect of the old data at time t < t is discarded exponentially. 
The parameter 9(t) is to be chosen at each time t to minimize the integral 
square of the error on all past data that are discounted exponentially. 
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Using (4.3.57), we express (4.3.56) in terms of the parameter 9, i.e., 


m = 1 f e -K>- AANlTmi]l ir 

2 Jo mUr) 


(4.3.58) 


Clearly, J(6 ) is convex over the space of 9 for each time t and the application 
of the gradient method for minimizing J{9) w.r.t. 9 yields 


9 = -rv J = r 


a -p(t-T) ( Z ( T ) - 0 T (*)<Kt)) 

m 2 (r) 




(4.3.59) 


where T = T t > 0 is a scaling matrix that we refer to as the adaptive gain. 
Equation (4.3.59) is implemented as 


T(R(t)9 + Q(t)) 

„ 6d) T 

(JR + „ , 

J5 

^0 

11 

0 

m z 

0 

^0 

11 

0 


(4.3.60) 


where R € 7 Z nxn ,Q 6 IZ nxl . We refer to (4.3.59) or (4.3.60) as the integral 
adaptive law. Its form is different from that of the previous adaptive laws 
we developed. The properties of (4.3.60) are also different and are given by 
the following theorem. 


Theorem 4.3.3 The integral adaptive law (4-3.60) guarantees that 

(i) e,en s ,9,9 e Coo 

(ii) e,en s ,9eC- 2 
(hi) lim^oo |0(f)| = 0 

(iv) if n s ,<j) £ Coo and 4> is PE then 9(t) converges exponentially to 9*. 
Furthermore, for T = 7 / the rate of convergence can be made arbitrarily 
large by increasing the value of the adaptive gain 7 . 

Proof Because — £ C^, it follows that R,Q £ C x and, therefore, the differential 
equation for 6 behaves as a linear time- varying differential equation with a bounded 
input. Substituting for z = cjA 9* in the differential equation for Q we verify that 

Q(t) = - f e-^-C ^ T ^ T ^ dr9* = - R(t)9 * 

Jo ™ 


(4.3.61) 
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and, therefore, 

9 = 0 = -TR(t)0 

We analyze (4.3.62) by using the Lyapunov-like function 


V(0) 


0 T V~ 1 0 

2 


(4.3.62) 


(4.3.63) 


whose time derivative along the solution of (4.3.62) is given by 

V = -O 1 R(t)0 (4.3.64) 

Because R(t) = R T (t) > 0 Vf > 0 it follows that V < 0; therefore, V,0,9 G C x , 
(8 T R.O)^ = \R^0\ G £ 2 . From e = — and 0, y) G £oo we conclude that e, em 
and, therefore, en s G 

From (4.3.62) we have 

\0\ < ||ri?ii||f?50| (4.3.65) 

which together with R G C <*, and \R^0\ G f) £ 2 imply that 0 G f) £ 2 . Since 

0,R G £oo, it follows from (4.3.62) that 0 G which, together with 0 G £ 2 , 
implies lim^oo |0(t)| = lim^oo |r.R(t)0(t)| = 0. 

To show that em G £ 2 we proceed as follows. We have 


Therefore, 


4 - FRO = e 2 m 2 - 20 t RTR6 - B6 t R 0 
dt H 


2 2 
e rn 


dr = 0 t R6 + 2 / O'RTROdr + (3 / 0 T R0 dr 


Because lim t ^ oo [d T (f)l?(t)0(t)] = 0 and \R*0\ G £ 2 it follows that 


lim 

t — >-oo 



2_2j 

e m dr 



e 2 m 2 dr < oo 


i.e., em. G £ 2 . 

Hence, the proof for (i) to (iii) is complete. The proof for (iv) is given in Section 
4.8. □ 


Remark 4.3.14 In contrast to the adaptive law based on the instantaneous 
cost, the integral adaptive law guarantees that 9(t) = —TV — > 0 
as t. — > oo without any additional conditions on the signal vector 0 
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and m. In this case, 6{t) converges to a trajectory that minimizes 
the integral cost asymptotically with time. As we demonstrated in 
Chapter 3 using simple examples, the convergence of 6{t) to a zero 
vector does not imply that 6(t) converges to a constant vector. 

In the proof of Theorem 4.3.3 (iv) given in Section 4.8, we have estab- 
lished that when n s ,<j) € Too and <f> is PE, the parameter error 6 satisfies 

m\ < ^fW o)le^> vt > To 

where 

a = 2f5 1 e~ !3To X m i n (T), = a 0 T 0 a 0 , a 0 = sup 

t m z (t) 

and ao, To are constants in the definition of the PE property of (j>, i.e. , «o > 0 
is the level of excitation and To > 0 is the length of the time interval. The 
size of the constant a > 0 indicates the speed with which 1 9(t) \ is guaranteed 
to converge to zero. The larger the level of excitation ato, the larger the a is. 
A large normalizing signal decreases the value of a and may have a negative 
effect on the speed of convergence. If T is chosen as T = 7 1, then it becomes 
clear that a larger 7 guarantees a faster convergence of | 0 (t)| to zero. 

Example 4.3.3 Let us consider the same problem as in Example 4.3.1. We consider 
the equation 

z = W(s)Asin(u;t + </?) = W{s)0* T (/) (4.3.66) 

where 9* = [Ai, A 2 ] T ,(/> = [sinwt,coswt] T , A\ = Acos^p, A 2 = Asirn^. We need to 
estimate A , ip using the knowledge of <j>, u>, W(s) and the measurement of 2 . 

We first express (4.3.66) in the form of the linear parametric model (4.3.48) by 
filtering <j) with W(s), i.e., 

2 = r T 0 o , = W{s)(j> (4.3.67) 

and then obtain the adaptive law for estimating 9* by applying the results of this 
section. The gradient algorithm based on the instantaneous cost is given by 

9 = Te4> 0 , e= m 2 = 1 + (4.3.68) 

m z 


where 9 = is the estimate of 9* and a > 0. Because 0 G C 0 0 and W(s) 

has stable poles, </>o € and a can be taken to be equal to zero. 



A 

A 


9 

9 


A (f) sin (2f + (p (f) ) 
A sin (2t + cp) 


Figure 4.8 Simulation results for Example 4.3.3: Performance of the gradient 
adaptive law (4.3.68) based on the instantaneous cost. 

The gradient algorithm based on the integral cost is given by 

6 = -r(i?(t)6» + Q), 0 ( 0 ) = 9 0 

R = -/TR+^E, r(0) = 0 (4.3.69) 

m z 

Q = Q( o) = o 

where R £ lZ 2x2 ,Q £ r R? xl . The estimate A,ip of the unknown constants A,tp is 
calculated from the estimates A -[ , A-i in the same way as in Example 4.3.1. We can 
establish, as shown in Example 4.3.1, that <f>o satisfies the PE conditions; therefore, 
both adaptive laws (4.3.68) and (4.3.69) guarantee that 6 converges to 6* exponen- 
tially fast. 

Let us now simulate (4.3.68) and (4.3.69). We choose T = diag( 7 ) with 7 = 10 
for both algorithms, and /3 = 0.1 for (4.3.69). We also use u) = 2 rad/sec and 
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A (f) sin (2f + (p (f) ) 
A sin (2t + cp) 


Figure 4.9 Simulation results for Example 4.3.3: Performance of the gra- 
dient adaptive law (4.3.69) based on the integral cost. 

W(s) = Figure 4.8 shows the performance of the adaptive law (4.3.68) based 
on the instantaneous cost, and Figure 4.9 shows that of (4.3.69) based on the integral 
cost. V 


Remark 4.3.15 This example illustrates that for the same estimation prob- 
lem the gradient method leads to adaptive laws that require more 
integrators than those required by adaptive laws based on the SPR- 
Lyapunov design approach. Furthermore, the gradient algorithm based 
on the integral cost is far more complicated than that based on the 
instantaneous cost. This complexity is traded off by the better conver- 
gence properties of the integral algorithm described in Theorem 4.3.3. 
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4.3.6 Least-Squares 

The least-squares is an old method dating back to Gauss in the eighteenth 
century where he used it to determine the orbit of planets. The basic idea 
behind the least-squares is fitting a mathematical model to a sequence of 
observed data by minimizing the sum of the squares of the difference between 
the observed and computed data. In doing so, any noise or inaccuracies in 
the observed data are expected to have less effect on the accuracy of the 
mathematical model. 

The method of least-squares has been widely used in parameter esti- 
mation both in a recursive and nonrecursive form mainly for discrete-time 
systems [15, 52, 73, 80, 127, 144], The method is simple to apply and analyze 
in the case where the unknown parameters appear in a linear form, such as 
in the linear parametric model 


z = e* T (j) (4.3.70) 

Before embarking on the use of least-squares to estimate 6* in (4.3.70), let 
us illustrate its use and properties by considering the simple scalar plant 


y = 0*U + d n 


(4.3.71) 


where d n is a noise disturbance; y,u G IZ + and u G Coo- We examine 
the following estimation problem: Given the measurements of y{r),u{r) for 
0 < r < t, find a “good” estimate 9(t ) of 6* at time t. One possible solution 
is to calculate 9{t) from 


m 


y(i~) = ,* d n {r) 
u(t ) u(t) 


for some r < t for which u(t ) 7 ^ 0. Because of the noise disturbance, 
however, such an estimate may be far off from 0* . A more natural approach 
is to generate 6 by minimizing the cost function 

J{ 6 ) = \ Jo ~ 6 ^ u ( T ^ 2dT (4.3.72) 

with respect to 6 at any given time t. The cost J{6) penalizes all the past 
errors from r = 0 to t that are due to 9(t) / 9*. Because J{9) is a convex 
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function over 7Z 1 at each time t, its minimum satisfies 

VJ(0) = — I f ' dr + 9 (t) f u 2 (t)(It = 0 (4.3.73) 

Jo Jo 

for any given time t, which gives 

9(t) = (/ u 2 {t)(1t ^ J y{T)u{T)dr (4.3.74) 

provided of course the inverse exists. This is the celebrated least-squares 
estimate. The least-squares method considers all past data in an effort to 
provide a good estimate for 9* in the presence of noise d n . For example, 
when u{t) = 1 Vf > 0 and d n has a zero average value, we have 

lim 9(t) = lim - /'y(r)u(r)dr = 9* + lim - [ d n (r)dr = 9* 
t —> oo t-» oo t Jo °° t Jo 

i.e. , 9{t ) converges to the exact parameter value despite the presence of the 
noise disturbance d n . 

Let us now extend this problem to the linear model (4.3.70). As in 
Section 4.3.5, the estimate z of z and the normalized estimation error are 
generated as 



z — z 
m 2 


z — 9 T cj) 
m3 


(4.3.49) 


where m 2 = 1 + n 2 s , 9(t) is the estimate of 9* at time t, and m satisfies 


4>/m G Too. 

We consider the following cost function 




-p (t-r) - 9 ( t)<t>(T )] 

m 2 (r) 


dr + -e-P\9 


9o) T Qo(9 - 9 q) 


(4.3.75) 

where Qo = Qq > 0, (3 > 0, 9q = 0(0), which is a generalization of (4.3.72) 
to include discounting of past data and a penalty on the initial estimate 8q 
of 9*. The cost (4.3.75), apart from the additional term that penalizes the 
initial parameter error, is identical to the integral cost (4.3.58) considered 
in Section 4.3.5. The method, however, for developing the estimate 9(t) for 
9* is different. Because z/m , (j)/rn £ Too, J(9) is a convex function of 9 over 
IZ n at each time t. Hence, any local minimum is also global and satisfies 


VJ(9(t)) =0, Vt > 0 
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i.e., 

VJ(0) = e-^Qom) - d 0 ) - ~ ^dr = 0 

Jo m z (T) 

which yields the so-called nonrecursive least-squares algorithm 

0(t) = P(t ) le-^Qodo + f e-^- T ) (4.3.76) 

L Jo m z (r) 

where 

P{t) = \e~^Qo+ [* ^ di\ (4.3.77) 

Jo m z (T) 

Because Q o = Qq > 0 and 4>(j) T is positive semidefinite, P(t) exists at each 
time t. Using the identity 

— pp- 1 = Pp- 1 + p— p- 1 = 0 

dt dt 

we can show that P satisfies the differential equation 

P = PP - P^P, P{ 0) = P 0 = Qo 1 (4.3.78) 

m z 

Therefore, the calculation of the inverse in (4.3.77) is avoided by generating P 
as the solution of the differential equation (4.3.78). Similarly, differentiating 
9{t) w.r.t. t and using (4.3.78) and em 2 = z — 9 T (f>, we obtain 

9 = Pe4> (4.3.79) 

We refer to (4.3.79) and (4.3.78) as the continuous-time recursive least- 
squares algorithm with forgetting factor. 

The stability properties of the least-squares algorithm depend on the 
value of the forgetting factor (3 as discussed below. 

Pure Least-Squares 

In the identification literature, (4.3.79) and (4.3.78) with (3 = 0 is referred 
to as the “pure” least-squares algorithm and has a very similar form as the 
Kalman filter. For this reason, the matrix P is usually called the covariance 
matrix. 
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Setting p = 0, (4.3.78), (4.3.79) become 

0 = Pep 

Ppp T P „ N . . 

P = —~2 ? P(0) = P 0 4.3.80) 

m z 

In terms of the P~ 1 we have 

d_ p ~ i = 
dt m 2 

which implies that d ^ P dt ^ > 0, and, therefore, P~ l may grow without bound. 
In the matrix case, this means that P may become arbitrarily small and slow 
down adaptation in some directions. This is the so-called covariance wind-up 
problem that constitutes one of the main drawbacks of the pure least-squares 
algorithm. 

Despite its deficiency, the pure least-squares algorithm has the unique 
property of guaranteeing parameter convergence to constant values as de- 
scribed by the following theorem: 

Theorem 4.3.4 The pure least-squares algorithm (4-3.80) guarantees that 

(i) e,en s ,d,0,P e Too. 

(ii) e, en s , 0eC-2- 

(iii) lim^oo d(t) = 6, where 6 is a constant vector. 

(iv) If n s , p 6 Too and p is PE, then 9{t) converges to 0* as t — > oo. 

Proof From (4.3.80) we have that P < 0, i.e., P(t) < Pq. Because P(t) is nonin- 
creasing and bounded from below (i.e., P(t) = P T (t) > 0 ,Vt > 0) it has a limit, 
i.e., 

lim P(t) = P 

t—> OO 

where P = P T > 0 is a constant matrix. Let us now consider 

-f-(P _1 0) = -P^PP^O + P~ l 9 = +ep = 0 
dt m 2 

where the last two equalities are obtained by using 9 = 9 , ^P _1 = — p~ 1 pp~ l 
and e = Hence, P” 1 (t)0(t) = Pq^^O), and, therefore, 9{t) = 

P(<)P o _1 0( 0) and lirn^oo 9{t) = PP o _1 0( 0), which implies that Hindoo 9{t) = 9* + 
PP o _1 0(O) = 9. 



196 


CHAPTER 4. ON-LINE PARAMETER ESTIMATION 


Because P(t) < Pq and 9(t) = P(t)P 0 1 ^(0) we have 9,9 £ Coo, which, together 
with ^ £ Coo, implies that em = and e, en s £ Coo ■ Let us now consider the 

function 

vmJNCM 

The time derivative V of V along the solution of (4.3.80) is given by 


~ T 0 r dxb r 6 o 9 

V = e9 T (f> + = —e 2 m 2 


< 0 


2 m 2 ' ' 2 2 

which implies that V £ Coo, em G £ 2 ; therefore, e, en s £ £ 2 - From (4.3.80) we have 


\0\ < ||£|| — |em| 
m 


Because P, ^,em £ Coo and em £ £ 2 , we have 9 £ Coo f) £21 which completes the 
proof for (i), (ii), and (iii). The proof of (iv) is given in Section 4.8. □ 


Remark 4.3.16 

(i) We should note that the convergence rate of 0(t) to 6* in Theorem 4.3.4 
is not guaranteed to be exponential even when (j> is PE. As shown in the 
proof of Theorem 4.3.4 (iv) in Section 4.8, P(t), 0(t) satisfy 


P(t) 


m 


where m = sup t m 2 {t), i.e. , |(9(f)| is guaranteed to converge to zero with 
a speed of ’ . 

(ii) The convergence of 0(t) to 0 as t — ► 00 does not imply that 0(t) — ► 0 as 
t — ► 00 (see examples in Chapter 3). 

(iii) We can establish that e, 9 — > 0 as t — ► 00 if we assume that (j)/m , m/m £ 
Coo as in the case of the gradient algorithm based on the instantaneous 
cost. 


Pure Least-Squares with Covariance Resetting 

The so called wind-up problem of the pure least-squares algorithm is avoided 
by using various modifications that prevent P(t) from becoming singular. 
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One such modification is the so-called covariance resetting described by 

0 = Pe<t> 

Pdxb^ P , 

p = gnp p^ t + ) = PQ = pQl ( 4 . 3 . 81 ) 

rrP 

where t r is the time for which A min(P{t)) < P\ and po > p\ > 0 are some 
design scalars. Because of (4.3.81), P(t) > p\I Vi > 0; therefore, P is 
guaranteed to be positive definite for all t > 0. 

Strictly speaking, (4.3.81) is no longer the least-squares algorithm that 
we developed by setting VJ(0) = 0 and (3 = 0. It does, however, behave as 
a pure least-squares algorithm between resetting points. The properties of 
(4.3.81) are similar to those of the gradient algorithm based on the instan- 
taneous cost. In fact, (4.3.81) may be viewed as a gradient algorithm with 
time-varying adaptive gain P. 


Theorem 4.3.5 The pure least-squares with covariance resetting algorithm 
(4-3.81) has the following properties: 

(i) e,en s ,8,9 <E Coo. 

(ii) e,en s ,8<EC- 2 - 

(iii) If n s ,(j) G Coo and (j) is PE then 9(t) converges exponentially to 6*. 


Proof The covariance matrix P(t) has elements that are discontinuous functions 
of time whose values between discontinuities are defined by the differential equation 
(4.3.81). At the discontinuity or resetting point t r , P(f+) = Pq = pM\ therefore, 
P _1 (t+) = pf l I. Between discontinuities jlp _1 (t) > 0 , i.e., P~ l (t, 2 ) — P -1 (ti) > 0 
\/t '2 > t\ > 0 such that t r which implies that P -1 (f) > Pq 1 /, Vt > 0. 

Because of the resetting, P(t ) > p±I, Vt > 0. Therefore, (4.3.81) guarantees that 

Pol > P(t) > piT pf 1 ! > P _1 (f) > Pq 1 !, Vt > 0 


Let us now consider the function 

qT p— In 

V(d) = (4.3.82) 

where P is given by (4.3.81). Because P -1 is a bounded positive definite symmetric 
matrix, it follows that V is decrescent and radially unbounded in the space of 6. 
Along the solution of (4.3.81) we have 


V= -0 r 


djP- 1 ) 

dt 


i + 0 T P~ 1 e= — e 2 m 2 


2 dt 
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Between resetting points we have from (4.3.81) that 


AP- 1 ) = UL 

dt m 2 


; therefore, 


2 2 1 fi) 2 

V = —e 2 m 2 + - - — Ifi- 
2 m- 


< 0 


(4.3.83) 


Vt € [ti,t 2 ] where [tipfi is any interval in [0, 00 ) for which t r e/L 
At the points of discontinuity of P, we have 

v{tt) - v(t r ) = l -e T {p-\tt) - p-\t r ))~e 


Because P _1 (f+) = -LR P~ 1 (t r ) > -LR it follows that V(tfi) — V(t r ) < 0, which 
implies that V > 0 is a nonincreasing function of time for all t > 0. Hence, V £ Coo 
and Hindoo V(t) = Voo < 00 . Because the points of discontinuities t r form a set 
of measure zero, it follows from (4.3.83) that em,e € C 2 . From V £ Coo and 
Pi 1 ! > P ~ 1 (t) > Pq X I we have 6 £ Coo, which implies that e,em £ C 00 ■ Using 
em £ Coo 0 ^2 and pM > P > p±I we have 9 £ Coo f) U 2 and the proof of (i) and 
(ii) is, therefore, complete. 

The proof of (iii) is very similar to the proof of Theorem 4.3.2 (iii) and is 
omitted. □ 


Modified Least-Squares with Forgetting Factor 

When (3 > 0, the problem of P(t) becoming arbitrarily small in some direc- 
tions no longer exists. In this case, however, P(t) may grow without bound 
since P may satisfy P > 0 because (3P > 0 and the fact that p< ^ 2 p is only 
positive semidefinite. 

One way to avoid this complication is to modify the least-squares algo- 
rithm as follows: 


0 

P 



if l|e(t)|| < Rq 

otherwise 


(4.3.84) 


where P{ 0) = Pq = Pq > 0, ||P 0 || < Rq and Rp is a constant that serves 
as an upper bound for ||P||. This modification guarantees that P G C^ 
and is referred to as the modified least-squares with forgetting factor. The 
above algorithm guarantees the same properties as the pure least-squares 
with covariance resetting given by Theorem 4.3.5. They can be established 
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by choosing the same Lyapunov-like function as in (4.3.82) and using the 
identity d ^ = —P~ 1 PP~ 1 to establish 


dP - 1 
dt 


-PP - 1 + ^ if ||P|| < Ro 

0 otherwise 


where P 1 (0) = P 0 , which leads to 


V = 


Af - § e T p~ l e if ||p|| < R 0 


otherwise 


• 2 2 

Because V < < 0 and P(t) is bounded and positive definite Vi > 0, 

the rest of the analysis is exactly the same as in the proof of Theorem 4.3.5. 


Least-Squares with Forgetting Factor and PE 

The covariance modifications described above are not necessary when n s , 
4> £ Too an d is PE- The PE property of <f> guarantees that over an interval of 
time, the integral of -PPjA-P is a negative definite matrix that counteracts 
the effect of the positive definite term (IP with /? > 0 in the covariance 
equation and guarantees that P £ Coo- This property is made precise by the 
following corollary: 

Corollary 4.3.2 Ifn s ,(j> £ Coo and p is PE then the recursive least-squares 
algorithm with forgetting factor (3 > 0 given by (4-3.78) and (4-3.79) guar- 
antees that P, P -1 £ Coo and that 6(f) converges exponentially to 9*. 

The proof is presented in Section 4.8. 

The use of the recursive least-squares algorithm with forgetting factor 
with cj) £ Coo an d f PE is appropriate in parameter estimation of stable 
plants where parameter convergence is the main objective. We will address 
such cases in Chapter 5. 

Let us illustrate the design of a least-squares algorithm for the same 
system considered in Example 4.3.1. 

Example 4.3.4 The system 


2 = lT(s)Asin(u4 + ip) 
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where A, p are to be estimated on-line is rewritten in the form 

2 = 0* T fo 


where 9* = [Ai, A 2 ] T , = W(s)<j> — [sinwf,coswf] T . The least-squares algo- 

rithm for estimating 9* is given by 

9 = Pe<t> o 

P = pP-P^fp, P(0) = p 0 I 

m z 

where e = , m 2 = 1 + <f$ po and P > 0, po > 0 are design constants. Because 

(f > o is PE, no modifications are required. Let us simulate the above scheme when 
A = 10 ,ip= 16° = 0.279 rad ,u> = 2 rad/sec, W{s) = Figure 4.10 gives the 
time response of A and ip, the estimate of A and tp, respectively, for different values 
of p. The simulation results indicate that the rate of convergence depends on the 
choice of the forgetting factor p. Larger P leads to faster convergence of A, (p to 
A = 10, p = 0.279, respectively. V 


4.3.7 Effect of Initial Conditions 

In the previous sections, we developed a wide class of on-line parameter 
estimators for the linear parametric model 

z = W(s)0* T iP + r) o (4.3.85) 

where rjo, the exponentially decaying to zero term that is due to initial 
conditions, is assumed to be equal to zero. As shown in Chapter 2 and in 
Section 4.3.3, ij q satisfies the equation 


<ho — A c wo, <^o(0) — B 0 x 0 

Vo = Cq uj 0 (4.3.86) 


where A c is a stable matrix, and xq is the initial value of the plant state at 
t = 0. 



<p 

9 


A ( t ) sin (2 1 + (p (f) ) 
A sin (2t + cp) 


Figure 4.10 Simulation results of Example 4.3.4 for the least-squares algo- 
rithm with forgetting factor and PE signals. 

Let us analyze the effect of 77 0 on the gradient algorithm 
9 = Te<j> 

e = — z = 6 J cj) (4.3.87) 

rri z 

<!> = W{s)4 ; 

that is developed for the model (4.3.85) with 770 = 0 in Section 4.3.5. 

We first express (4.3.87) in terms of the parameter error 9 = 6 — 9*, i.e. , 

= re^ 



9 
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z — z — 9 t 4> + r/o 


(4.3.88) 


It is clear that r/o acts as a disturbance in the normalized estimation er- 
ror and, therefore, in the adaptive law for 6. The question that arises now 
is whether r / q will affect the properties of (4.3.87) as described by Theo- 
rem 4.3.2. We answer this question as follows. 

Instead of the Lyapunov-like function 

V® - ^ 

used in the case of r/o = 0 , we propose the function 


V(6,u 0 ) = 


6 t T~ 1 6 


+ <^o -fWo 


where Pq = Pj > 0 satisfies the Lyapunov equation 

PoA c + AjTb = — 7o-f 

for some 70 > 0 to be chosen. Then along the solution of (4.3.87) we have 
V = 0 T 4 >e - 7 o|w 0 | 2 = —e 2 m 2 + er/ 0 - 7 o|^o | 2 
Because r/o = CqUq we have 

V < -e 2 m 2 + |e| |w 0 | - 7o|w 0 | 2 

< _e 2 m 2 1 (_ ,, i| 2 / 7 ,_ IC'o’I 2 ^ 




I c T I 2 

By choosing 70 > % we have 


V< — 


which implies that 0 G , em G £2 • Because r/o £ £00 D £2 and f n G £00 
we have e, em,0 G £oo D £ 2 - Hence, (i) and (ii) of Theorem 4.3.2 also hold 
when j/o / 0. In a similar manner we can show that r/o 7 ^ 0 does not affect 
(iii) of Theorem 4.3.2. As in every dynamic system, r/o 7 ^ 0 will affect the 
transient response of 6(t) depending on how fast r/o (t) — ► 0 as t — > 00 . 

The above procedure can be applied to all the results of the previous sec- 
tions to establish that initial conditions do not affect the established prop- 
erties of the adaptive laws developed under the assumption of zero initial 
conditions. 
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4.4 Adaptive Laws with Projection 

In Section 4.3 we developed a wide class of adaptive laws for estimating the 
constant vector 9* that satisfies the linear parametric model 

2 = W(s)d* T ip (4.4.1) 

by allowing 9* to lie anywhere in IZ n . In many practical problems where 9* 
represents the parameters of a physical plant, we may have some a priori 
knowledge as to where 6* is located in lZ n . This knowledge usually comes in 
terms of upper or lower bounds for the elements of 9* or in terms of a well- 
defined subset of 7 Z n , etc. One would like to use such a priori information 
and design adaptive laws that are constrained to search for estimates of 
9* in the set where 9* is located. Intuitively such a procedure may speed 
up convergence and reduce large transients that may occur when 0(0) is 
chosen to be far away from the unknown 9*. Another possible reason for 
constraining 0{t) to lie in a certain set that contains 9* arises in cases where 
9{t) is required to have certain properties satisfied by all members of the set 
so that certain desired calculations that involve 9{t) are possible. Such cases 
do arise in adaptive control and will be discussed in later chapters. 

We examine how to modify the adaptive laws of Section 4.3 to handle 
the case of constrained parameter estimation in the following sections. 

4.4.1 Gradient Algorithms with Projection 

Let us start with the gradient method where the unconstrained minimization 
of J(9) considered in Section 4.3.5 is extended to 

minimize J(9) 

subject to 9 £ S (4.4.2) 

where S is a convex set with a smooth boundary almost everywhere. Let S 
be given by 

5 = {9 e 1Z n \ g{9) < 0} (4.4.3) 

where g : 7Z n i— > R is a smooth function. The solution of the constrained 
minimization problem follows from the gradient projection method discussed 
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in Appendix B and is given by 

r -rv J if 0 £ S° 

0 = Pr(-rVJ) = < or # e S(S) and -{TVJ) T \/g<0 

[ -rvj + r ^^ rvj otherwise 

(4.4.4) 

where S° is the interior of S, 5(S) is the boundary of S and 0(0) is chosen 
to be in S, i.e. , 0(0) £ S. 

Let us now use (4.4.4) to modify the gradient algorithm 0 = — rVJ(0) = 
Tecj) given by (4.3.52). Because VJ = —ecj), (4.4.4) becomes 


0 = Pr(re^.) 


Te<f> 


if 0 £ 5° 

or if 0 £ 5(S) and (Te</>) T Vg < 0 
otherwise 

(4.4.5) 


where 0(0) £ S. 

In a similar manner we can modify the integral adaptive law (4.3.60) by 
substituting VJ = R(t)6 + Q(t) in (4.4.4). 

The principal question we need to ask ourselves at this stage is whether 
projection will destroy the properties of the unconstrained adaptive laws 
developed in Section 4.3.5. This question is answered by the following The- 
orem. 


Theorem 4.4.1 The gradient adaptive laws of Section 4-3.5 with the pro- 
jection modification given by (4-4-4) retain all their properties that are estab- 
lished in the absence of projection and in addition guarantee that 6 £ S Vf > 
0 provided 0(0) = 0q £ S and 9* £ S. 


Proof It follows from (4.4.4) that whenever 9 £ S(S) we have 0 T V<? < 0, which 
implies that the vector 6 points either inside S or along the tangent plane of 6(S) 
at point 6 . Because 0(0) = 0o € S, it follows that 0(f) will never leave S , i.e., 
0(f) € S Vf > 0. 

The adaptive law (4.4.4) has the same form as the one without projection except 
for the additional term 


Q = 


r^^rvj if 0 £ S(S) and -(rvj) T v ff > 0 
0 otherwise 


in the expression for 0. If we use the same function V as in the unconstrained case 
to analyze the adaptive law with projection, the time derivative V of V will have 



4.4. ADAPTIVE LAWS WITH PROJECTION 


205 


the additional term 


Ft-'q 



e T N^_YVJ if 9 G S(S) and -(rVJ) T Vg > 0 
0 otherwise 


Because of the convex property of S and the assumption that 9* G S , we have 
FVg = (9 - 9*) T Vg > 0 when 9 G S(S). Because V 5 T rVJ=(rVJ) T Vg<0 
for 9 G S(S) and — (rVJ) T Vg > 0, it follows that 9 t T~ 1 Q < 0. Therefore, the 
term 9 t T~ 1 Q introduced by the projection can only make V more negative and 
does not affect the results developed from the properties of V. V. Furthermore, 
the £2 properties of 9 will not be affected by projection because, with or without 
projection, 9 can be shown to satisfy 

\9\ 2 < c|rvj| 2 


for some constant c G 1Z + . 


□ 


The projection modification (4.4.4) holds also for the adaptive laws based 
on the SPR-Lyapunov design approach even though these adaptive laws are 
not derived from the constrained optimization problem defined in (4.4.2). 
The reason is that the adaptive law (4.3.35) based on the SPR-Lyapunov 
approach and the gradient algorithm based on the instantaneous cost have 
the same form, i.e., 

9 = Tecfr 

where e, 4> are, of course, not the same signals in general. Therefore, by 
substituting for — TVJ = Tecj) in (4.4.4), we can obtain the SPR-Lyapunov 
based adaptive law with projection. 

We can establish, as done in Theorem 4.4.1, that the adaptive laws based 
on the SPR-Lyapunov design approach with the projection modification re- 
tain their original properties established in the absence of projection. 

Remark 4.4.1 We should emphasize that the set S is required to be convex. 
The convexity of S helps in establishing the fact that the projection 
does not alter the properties of the adaptive laws established without 
projection. Projection, however, may affect the transient behavior of 
the adaptive law. 


Let us now consider some examples of constrained parameter estimation. 
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Example 4.4.1 Consider the linear parametric model 

^ = e* r (/) 

where 9* = [91,9^ ■ ■ ■ , 0*] T is an unknown constant vector, 9\ is known to satisfy 
|0*| > po >0 for some known constant po, sgn(0*) is known, and z,(j> can be 
measured. We would like to use this a priori information about 9 £ and constrain 
its estimation to always be inside a convex set S which contains 9* and is defined 
as 

S={9eK n \ g(9) = po — 6»isgn(0J) < 0} 

The gradient algorithm with projection becomes 

( Te<j) if po - 0isgn(0*) < 0 

9=1 t or if po — 0iSgn(0*) = 0 and (Te</>) T V<? < 0 (4.4.6) 

Te<j) — T Yecj) otherwise 

where e = z ~^ 2 ^ and 0i(O) satisfies p 0 — 0i(O)sgn(0*) < 0. For simplicity, let us 
choose r = diag{ji,To} where 71 > 0 is a scalar and To = Tj >0, and partition 
(j>,9 as <j) = [0i,0j] T , 9 = [0i,0(]~] T where € TZ 1 . Because 

V<7 = [— sgn(0*),O, . . . , 0] T 


it follows from (4.4.6) that 


9i = 


71^1 


0 


0o = Toe^o 


if 6» 1 sgn(6»i) > po or 

if 6»isgn(6»i) = po and 7i</>iesgn(0j;) > 0 

otherwise 


where 0o(O) is arbitrary and 0i(O) satisfies 0i(O)sgn(0j;) > p 0 . 


(4.4.7) 

V 


4.4.2 Least-Squares with Projection 

The gradient projection method can also be adopted in the case of the least 
squares algorithm 

6 = Pec/) 

P = PP-P^P, P(0) = P 0 = Qc 1 (4.4.8) 

developed in Section 4.3.6 by viewing (4.4.8) as a gradient algorithm with 
time varying scaling matrix P and ecj) as the gradient of some cost function 
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J. If S = {8 £ IZ n | g(8) < 0} is the convex set for constrained estimation, 
then (4.4.8) is modified as 

r p ^ 

8 = Pr (Pe</>) = l 

{ p* - p J0r, p ^ 

where 8(0) € S and 

f pp - P^-P if 8eS° 

P = < m or if 0 e (5(5) and (Pe</>) T V 5 < 0 (4.4.10) 

[ 0 otherwise 

where P( 0) = Pq = Pq > 0. 

It can be shown as in Section 4.4.1 that the least-squares with projection 
has the same properties as the corresponding least-squares without projec- 
tion. 

The equation for the covariance matrix P is modified so that at the point 
of projection on the boundary of S, P is a constant matrix, and, therefore, 
the adaptive law at that point is a gradient algorithm with constant scal- 
ing that justifies the use of the gradient projection method explained in 
Appendix B. 

Example 4.4.2 Let us now consider a case that often arises in adaptive control 
in the context of robustness. We would like to constrain the estimates 9(t) of 0* to 
remain inside a bounded convex set S. Let us choose S as 

5= [6 G K n \8 T 8-Ml < 0} 

for some known constant Mq such that \9*\ < Mq. The set S represents a sphere in 
1Z n centered at 9 = 0 and of radius M 0 . We have 


if 8 e 5° 

or if 8 G <5 (5) and (PecjT^Vg < 0 
otherwise 

(4.4.9) 


Vg = 29 


and the least-squares algorithm with projection becomes 


0 = 


Pecf) if 9 t 9 < Mq 

or if 6 t 9 = Mq and (Pe(f>) T 9 < 0 

— gTpg ^j Pe(t> otherwise 


(4.4.11) 
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where 0(0) satisfies 0 T (0)0(0) < M§ and P is given by 
( PP-P^P if 0 T 0 < Mq 

P= < m or if 0 T 0 = Mq and (Pe^) T 0 < 0 (4.4.12) 

[ 0 otherwise 

Because P = P T > 0 and 9 T P9 > 0 when 0 T 0 = Mq, no division by zero occurs in 
(4.4.11). V 


4.5 Bilinear Parametric Model 

As shown in Chapter 2, a certain class of plants can be parameterized in 
terms of their desired controller parameters that are related to the plant 
parameters via a Diophantine equation. Such parameterizations and their 
related estimation problem arise in direct adaptive control, and in particular, 
direct MRAC, which is discussed in Chapter 6. 

In these cases, 9*, as shown in Chapter 2, appears in the form 

z = W(s)[p*(6* T 'iJ; + z 0 )\ (4.5.1) 

where p* is an unknown constant; z,ip,zo are signals that can be measured 
and W(s) is a known proper transfer function with stable poles. Because 
the unknown parameters p * , 9* appear in a special bilinear form, we refer to 
(4.5.1) as the bilinear parametric model. 

The procedure of Section 4.3 for estimating 9* in a linear model extends 
to (4.5.1) with minor modifications when the sgn(p*) is known or when 
sgn(p*) and a lower bound po of \p*\ are known. When the sgn(p*) is un- 
known the design and analysis of the adaptive laws require some additional 
modifications and stability arguments. We treat each case of known and 
unknown sgn(p*),/9o separately. 

4.5.1 Known Sign of p* 

The SPR-Lyapunov design approach and the gradient method with an in- 
stantaneous cost function discussed in the linear parametric case extend to 
the bilinear one in a rather straightforward manner. 
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Let us start with the SPR-Lyapunov design approach. We rewrite (4.5.1) 
in the form 

z = W{s)L(s)p*{6* T (j) + z i) (4.5.2) 

where z\ = L~ 1 (s)zo, (j) = L^ 1 (s)'ijj and L(s) is chosen so that L~ 1 (s) is 
proper and stable and WL is proper and SPR. The estimate z of z and the 
normalized estimation error are generated as 

z = W (s)L(s)p(6 T (j) + Zl ) (4.5.3) 

e = z — z — W (s)L(s)e n 2 (4.5.4) 

where n s is designed to satisfy 

— , — € Coo, m 2 = 1 + n 2 (A2) 

m m 

and p(t),6(t) are the estimates of p*,9* at time t, respectively. Letting 
p = p — p* , 6 = 9 — 9*, it follows from (4.5.2) to (4.5.4) that 

e = W(s)L(s)[p*9* T (f> — pz\ — p9 T — en 2 ] 

Now p*e* T 4> - P 9 T $ = p*9* T cj) - p*9 T 0 + p*9 T 4> - p# T </> = -p*0 T </> - p9 T (j) 
and, therefore, 

e = W(s)L(s)[-p*9 T (j) -~pi- en 2 ], £ = 9 T $ + z i 

A minimal state representation of (4.5.5) is given by 

e = A c e + B c (—p*9 T cf> - p£ - en 2 ) 
e = Cje 


(4.5.5) 

(4.5.6) 


where Cj ( si — A c ) l B c = W(s)L(s) is SPR. The adaptive law is now de- 
veloped by considering the Lyapunov-like function 


V(9,p) 


e T P c e 

2 


Ip* 


IT T-l-1 


9 p 2 

+ 

2 2y 


where P c = Pj > 0 satisfies the algebraic equations given by (4.3.32) that 
are implied by the KYL Lemma, and T = T t > 0 , 7 > 0. Along the solution 
of (4.5.6), we have 


V = — 


e T qq T e 


- -e T L c e - p* e9 1 cj) - epi 


e 2 n 2 s + \p*\0 T r~ 1 d + 


PP 

7 


2 
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where v > 0, L c = Lj > 0. Because p* = |p*|sgn(p*) it follows that by 
choosing 


0 = 9 = re0sgn(/?*) 
p = p = 1 e£ 


we have 


V = — 


e T qq T e 


— -e T L c e — e 2 n 2 s < 0 
2 s - 


The rest of the analysis continues as in the case of the linear model. We sum- 
marize the properties of the bilinear adaptive law by the following theorem. 


Theorem 4.5.1 The adaptive law (4-5.7) guarantees that 

(i) e,0,p£C oo- 

(ii) e,en s ,0,p G C 2 - 

(iii) If 4>,4> £ Coo, is PE and £ £ £ 2 , then Oft) converges to 0* as t — ► 00 . 

(iv) If £ £ C 2 , the estimate p converges to a constant p independent of the 
properties of f. 

Proof The proof of (i) and (ii) follows directly from the properties of V. V by 
following the same procedure as in the linear parametric model case and is left as 
an exercise for the reader. The proof of (iii) is established by using the results of 
Corollary 4.3.1 to show that the homogeneous part of (4.5.6) with p£ treated as 
an external input together with the equation of 0 in (4.5.7) form an e.s. system. 
Because p£ £ £2 and A c is stable, it follows that e, 6 — > 0 as t — > 00 . The details of 
the proof are given in Section 4.8. The proof of (iv) follows from e, £ £ £2 and the 
inequality 


, , 1 
pt pt / pOO \ 2 / poo 

/ \p\dr < 7 / |e£|dT < 7 ( / e 2 dr ) ( / f 2 dr 


< 00 


Jo Jo \J 0 / VO / 

which implies that p £ C\. Therefore, we conclude that p(t) has a limit p, i.e., 

liirp^oo p{t) = p. □ 


The lack of convergence of p to p* is due to £ £ £ 2 - If, however, (j>, £ 
are such that <f> a = [<?i T ,£] T is PE, then we can establish by following the 
same approach as in the proof of Corollary 4.3.1 that 9,p converge to zero 
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exponentially fast. For £ G C 2 , the vector <p a cannot be PE even when cf) is 
PE. 

For the gradient method we rewrite (4.5.1) as 

z = p*(e* T cj) + Zl ) (4.5.8) 


where z\ = W(s)zo,(f> = W(s)il>. Then the estimate z of z and the normal- 
ized estimation error e are given by 


z = p(8 T 4> + zi) 

z — z z — p(0 T cj) + z\) 


where n 2 is chosen so that 

0 Z 1 r 2 _ 1 2 

— , — E H 00 1 wi — 1 + 
m m 

As in the case of the linear model, we consider the cost function 


(4.5.9) 


(A2) 


J(p,9) 


e 2 m 2 


2 


(z- p*d T <j)- p£ + p*Z- p*z 1) 2 
2 m 2 


where £ = 8 T (f>+ z\ and the second equality is obtained by using the identity 
— p(0 T cj) + z\) = — p£ — p*0 T cj) + p*^ — p* zi. Strictly speaking J(p,0) is not a 
convex function of p. 9 over IZ n+1 because of the dependence of £ on 9. Let 
us, however, ignore this dependence and treat £ as an independent function 
of time. Using the gradient method and treating £ as an arbitrary function 
of time, we obtain 

6 = Tip*e<t>, p = ye£ (4.5.10) 

where Ti = > 0,7 > 0 are the adaptive gains. The adaptive law (4.5.10) 

cannot be implemented due to the unknown p* . We go around this difficulty 
as follows: Because Ti is arbitrary, we assume that Pi = for some other 
arbitrary matrix T = T t > 0 and use it together with p* = |/o*|sgn(p*) to 
get rid of the unknown parameter p*, i.e. , Tip* = r 77 P* = Tsgn(p*) leading 
to 

0 = Te(j)Sgn(p*), p = ye£ (4.5.11) 

which is implement able. The properties of (4.5.11) are given by the following 
theorem. 
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Theorem 4.5.2 The adaptive law (4-5.11) guarantees that 

(i) e,en s ,6,6,p,p e Coo- 

(ii) e,en s ,9,p £ C 2 . 

(iii) If n s , cj) £ Too, </> is PE and £ £ C 2 , then 9(t) converges to 9* as t.—> 00 . 

(iv) If £ £ £ 2 ; t/ien p converges to a constant p as t — > 00 independent of 
the properties of 4>. 

The proof follows from that of the linear parametric model and of The- 
orem 4.5.1, and is left as an exercise for the reader. 

The extension of the integral adaptive law and least-squares algorithm to 
the bilinear parametric model is more complicated and difficult to implement 
due to the appearance of the unknown p* in the adaptive laws. This problem 
is avoided by assuming the knowledge of a lower bound for \p*\ in addition 
to sgn(p*) as discussed in the next section. 

4.5.2 Sign of p* and Lower Bound p 0 Are Known 

The complications with the bilinearity in (4.5.1) are avoided if we rewrite 
(4.5.1) in the form of the linear parametric model 

z = 9* T (j) (4.5.12) 

where 9* = \9\,9(^} T ,(() = [^ 1 , 0 T ] T , and 9\ = p*,9^ = p*9*. We can now 
use the methods of Section 4.3 to generate the estimate 9(t) of 9* at each 
time t. From the estimate 9 = [9\ , of 9*. we calculate the estimate p, 9 
of p* , 9* as follows: 

p(t) = 9\ (l). 9(1.) = |jj|| (4.5.13) 

The possibility of division by zero or a small number in (4.5.13) is avoided 
by constraining the estimate of 9\ to satisfy |^i(t)| > po > 0 for some 
Po < |p* |- This is achieved by using the gradient projection method and 
assuming that po and sgn(p*) are known. We illustrate the design of such a 
gradient algorithm as follows: 

By considering (4.5.12) and following the procedure of Section 4.3, we 
generate 

z — z 

m 2 


z = 9 T (j), e 


(4.5.14) 
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where m 2 = 1 + n 2 and n s is chosen so that (p/rn G e.g. n 2 = (p T cp. The 
adaptive law is developed by using the gradient projection method to solve 
the constrained minimization problem 


min J(9) 
e 


- (z 

= min — 
e 


- e T cp) 2 
2 m 2 


subject to pQ — 0 isgn(p*) < 0 

i.e., 

(Veep if Po — ^isgn(/?*) < 0 

Q = J or if po — 0 isgn(/ 9 *) = 0 and ( Fe(p) T \7g < 0 

l Te d ~ T VnTVQ Te ^ otherwise 

(4.5.15) 

where g(9 ) = po — 0isgn(p*). For simplicity, let us assume that V = 
diag{'yi,T 2 } where 71 > 0 is a scalar and r 2 = Tj >0 and simplify the 
expressions in (4.5.15). Because 

V <7 = [— sgn(p*), 0, . . . , 0] T 


it follows from (4.5.15) that 

{ 7ie^i if 0isgn(p*) > p 0 

or if 0isgn( / o*) = po and — 71(^1 esgn(p*) < 0 (4.5.16) 

0 otherwise 

where $i(0) satisfies 0i(O)sgn(p*) > po, and 

0 2 = r 2 e</> 2 (4.5.17) 

where (pi = z\, (p 2 = (p- 

Because 6\{t) is guaranteed by the projection to satisfy |$i(f)| > po > 0, 
the estimate p(t), 9(t) can be calculated using (4.5.13) without the possibility 
of division by zero. The properties of the adaptive law (4.5.16), (4.5.17) with 
(4.5.13) are summarized by the following theorem. 


Theorem 4.5.3 The adaptive law described by (4-5.13), (4-5.16), (4-5.17) 
guarantees that 
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(i) e, en s ,p, 9,'p, 8 £ C^. 

(ii) e,en s ,p,9 £ C 2 . 

(iii) Ifn s ,(f> £ Coo and f is PE, then 9,6, p converge to 9*,6*,p*, respec- 
tively, exponentially fast. 

Proof Consider the Lyapunov-like function 

~e[ f 2 Tf% 

271 2 

where 9i = 9\ — 9\, 9 2 = 9 2 — 0 2 . Then along the solution of (4.5.16), (4.5.17), we 
have 

! -e 2 ?n 2 if 0isgn(p*) > p 0 

or if 0isgn (p*) = p 0 and -7i</qesgn(p*) < 0 (4.5.18) 

0 2 f> 2 e if 0isgn(p*) = p 0 and -7i^iesgn(p*) > 0 

Because em 2 = —9 (j> = —9if>i — 9 2 we have 9 2 (j> 2 e = — e 2 m 2 — 9±e(j)i. For 
0isgn(p*) = p 0 (he., 9 X = p 0 sgn(p*)) and -7i0iesgn(p*) > 0, we have (j^e = 
(PoSgn(p*) - |p*|sgn(p*))0!e = (p 0 - |p*|)sgn(p*)^ 1 e > 0 (because p 0 ~\p*\ <0 and 
sgn(p*)<^ie < 0), which implies that 

e~9 2 (j> 2 = -e 2 m 2 - (p 0 - \p* |)sgn(p*) < -e 2 m 2 

Therefore, projection introduces the additional term — e</>i(po — |p*|)sgn(p*) that 

can only make V more negative. Substituting for e9 2 <p 2 < — e 2 m 2 in (4.5.18) we 
have 

V < -e 2 m 2 

which implies that 9\, 9 2 £ Coo', C en s G £oo fj C 2 . 

Because <f> / rn € C^ and em £ CooC\C 2 , it follows from (4.5.16), (4.5.17) that 
9i £ Coo f) C 2 , i = 1,2. 

Using (4.5.13), we have p = 9\, 9 = |^ — which, together with 9i £ 
Coo fl C 2 , i — 1,2, |0i| > p 0 > 0, imply that p,9 £ Coo[\C 2 . 

The convergence of 9 to 9* follows from that of Theorem 4.3.2 (iii) and the fact 
that projection can only make V more negative. The convergence of 9,p to 9 * , p* 
follows from that of 9*, equation (4.5.13), assumption |0*| > p 0 and |0i(t)| > po 
Vt > 0. □ 


In a similar manner one may use (4.5.12) and (4.5.13) to derive adaptive 
laws using the integral cost function and least-squares with 0\ constrained 
to satisfy |$i(t)| > po > 0 Vt > 0. 
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4.5.3 Unknown Sign of p* 

The problem of designing adaptive laws for the bilinear model (4.5.1) with 
sgn(p*) unknown was motivated by Morse [155] in the context of MRAC 
where such an estimation problem arises. Morse conjectured that the sgn(p*) 
is necessary for designing appropriate adaptive control laws used to solve 
the stabilization problem in MRAC. Nussbaum [179] used a simple example 
to show that although Morse’s conjecture was valid for a class of adaptive 
control laws, the sgn(p*) is not necessary for stabilization in MRAC if a 
different class of adaptive or control laws is employed. This led to a series of 
results on MRAC [137, 156, 157, 167, 236] where the sgn(p*) was no longer 
required to be known. 

In our case, we use the techniques of [179] to develop adaptive laws for 
the bilinear model (4.5.1) that do not require the knowledge of sgn(/ 9 *). The 
design and analysis of these adaptive laws is motivated purely from stability 
arguments that differ from those used when sgn(p*) is known. 

We start with the parametric model 

2 = p*(0* T cf> + zi ) 

and generate z, e as 

z = N (x) p(9 r cf> + zi) , e =- — 

m z 

where m 2 = 1 + n 2 is designed so that ^ ^ , 

N(x) = x 2 cosx (4.5.19) 

x is generated from 

0 2 

x = w +— ; w = e 2 m 2 , w(0) = 0 (4.5.20) 

2 7 

where 7 > 0. The following adaptive laws are proposed for generating p, 6: 

6 = N(x)Fe<j>, p = N{x)-je£ (4.5.21) 

where £ = 9 T (j) + z\ . The function N(x) plays the role of an adaptive gain 
in (4.5.21) and has often been referred to as the Nussbaum gain. Roughly 
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speaking, N(x) accounts for the unknown sign of p* by changing the sign of 
the vector field of 6, p periodically with respect to the signal x. 

The design of (4.5.19) to (4.5.21) is motivated from the analysis that is 
given in the proof of the following theorem, which states the properties of 
(4.5.19) to (4.5.21). 

Theorem 4.5.4 The adaptive law (4-5.19) to (4-5.21) guarantees that 

(i) x, w, 9, p e Coo 

(ii) e, en s ,6,p G C oc f]C- 2 

Proof We start by expressing the normalized estimation error e in terms of the 
parameter error 0, i.e. , 


era = z — z 


= p*e* T ct> + p*z i - p*0 T c( - p*z i + p*( - N{x)p( 
= -p*0 T (t> + p*Z- N(x)p( 


(4.5.22) 


We choose the function 


whose time derivative along the solution of (4.5.21) is given by 

V = 9 T cj)eN(x) 


(4.5.23) 


Substituting 9 T </> = T[—em 2 + p*( — N(x)p(\ from (4.5.22) into (4.5.23), we obtain 
V = - P*e( + N(x)pe(] 


• n( x) r o 2 pp 1 p 
V = + — + - 

P* L 7 J 7 

From (4.5.20), we have x = e 2 m 2 + — which we use to rewrite V as 


V = — 


N(x)x p 
P* + 7 


(4.5.24) 


Integrating (4.5.24) on both sides we obtain 


V(t) - T(0) = p(t) m - -i r N(a)dc 
7 P Jo 
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Because N(x) = x 2 cos x and a 2 cos a da = 2x cos x + (x 2 — 2) sin x , it follows 
that 

V(t) — V (0) = ^ — [2x cos x + (x 2 — 2) sin x\ (4.5.25) 

7 P* 

2 

From x = w + ^ and w > 0, we conclude that x(t) > 0. Examination of (4.5.25) 
shows that for large x, the term a; 2 sin a; dominates the right-hand side of (4.5.25) 
and oscillates between — x 2 and a; 2 . Because 


V(t) 


E(0) + 


p{t) - p{ 0) 

7 


1 .. „ . , a: 2 sin x 

— 2a; cos x — 2 sin x\ 

p* p* 


and V (t) > 0, it follows that x has to be bounded, otherwise the inequality V > 0 
will be violated for large x. Bounded x implies that V, w,p,6 £ £oo- Because w(t) = 
f q e 2 m 2 dt is a nondecreasing function bounded from above, the lim^oo w(t) = 
e 2 m 2 dt exists and is finite which implies that em £ £ 2 , i-e., e, en s £ £ 2 - The 
rest of the proof follows directly as in the case of the linear parametric model and 
is omitted. □ 


4.6 Hybrid Adaptive Laws 

The adaptive laws developed in Sections 4.3 to 4.5 update the estimate 9(t) 
of the unknown parameter vector 9* continuously with time, i.e., at each 
time t we have a new estimate. For computational and robustness reasons it 
may be desirable to update the estimates only at specific instants of time ty. 
where is an unbounded monotonically increasing sequence in IZ + . Let 
tk = kT s where T s = t^ + \ — is the “sampling” period and k = 0, 1, 2, ... ,. 
Consider the design of an adaptive law that generates the estimate of the 
unknown 9* at the discrete instances of time t = 0, T s . 2 T s , 

We can develop such an adaptive law for the gradient algorithm 

9 = Tecj) (4.6.1) 

z — z 

6 = 2 ~ 

m- 

where z is the output of the linear parametric model 


(4.6.2) 
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and 

z = O t 4> (4.6.3) 

Integrating (4.6.1) from t k = kT s to t k +i = (k + 1 )T S we have 

rtk+i 

0 k+ i = dk + T e(r)0(r)dr, 6 0 = 0( 0), k = 0,1,2,... (4.6.4) 

Jt k 

where 0). = 0(t k ). Equation (4.6.4) generates a sequence of estimates, i.e., 
0 O = 0(0), = 0(T S ), 0 2 = 0(2 T s ), . . • , 0 k = 0{kT s ) of 0*. 

If we now use (4.6.4) instead of (4.6.1) to estimate 0*, the error e and z 
have to be generated using 0 k instead of 0 (f), i.e., 

z(t) = 0fc<Mt), e(t) = ^ 2 ^)^ ’ e [tkitk+i] (4.6.5) 

We refer to (4.6.4), (4.6.5) as the hybrid adaptive law. The following 
theorem establishes its stability properties. 


Theorem 4.6.1 Let m,T s ,T be chosen so that 

(a) ^4 < 1 , m > 1 

(b) 2 — T s \ m > 7 for some 7 > 0 

where X m = A max (r). Then the hybrid adaptive law (4.6.4), (4-6.5) guaran- 
tees that 

(i) 0 k £ lo o- 

(ii) A0 k G l 2 ; e, em G Coo D £2, where A0 k = 0 fc+1 - 0 k . 

(iii) If m,(j) € £00 and <j) is PE, then 0 k — > 0* as k — > 00 exponentially fast. 

Proof As in the stability proof for the continuous adaptive laws, we evaluate the 
rate of change of the Lyapunov-like function 

v(k) = ejT~ 1 e k (4.6.6) 

along the trajectory generated by (4.6.4) and (4.6.5), where 9 k = 0 k — 0*. Notice 
that 


A V(k) = (2 0 k + A0 fe ) T r- 1 A0 fc 
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where A V(k) = V{k + 1) — V(k). Using (4.6.4) we obtain 


A V{k) = 2 9j 


f‘tk + 1 


e(r)0(r)^r - 


't k 


ctk+i 


e{r)(j){T)dr ) V 


’t k 


rtk+i 


e(r)(j)(T)dT 


'tk 


Because em 2 = —9j(j)(t) and X m > ||r||, we have 

rtk+i 


A V(k) < -2 


“(t)?ti (r)dr + A„ 


tk+1 | ( w I 

|e(r)m(r)|— — — dr I (4.6.7) 


'tfe 


0) 


Using the Schwartz inequality, we can establish that 


r +1 |e(r)m(r)|M^ < e 2 (r)m 2 (r)d 

Jt k m\T) j j tk 


m 


ftk+ 1 


I^COI 

to(t) 


< To 


rtk+i 


'tk 


2 (r)m 2 (r)dT 


dr 

(4.6.8) 


where the last inequality follows from assumption (a). Using (4.6.8) in (4.6.7), it 
follows that 


r z k+ 1 

A V(k) < -(2 - T s X m ) / e 2 (r )m 2 (r )d 

Jtk 


(4.6.9) 


Therefore, if 2 — T s X m > 7 for some 7 > 0, we have A V(k) < 0, which implies 
that V(k ) is a nonincreasing function and thus the boundedness of V(k),9k and 9k 
follows. From (4.6.9), one can easily see that 


rtk+t 


2 (t)7ji 2 (r)dr < 


U(0) - V{k+ 1) 
(2 — T s X m ) 


(4.6.10) 


We can establish that lim k^ooV(k + 1) exists and, from (4.6.10), that em € 
Loo D £- 2 - Because m > 1, it follows immediately that e G £00 f) £2- 
Similarly, we can obtain that 


r z k + 1 

A9j A9k < T s X 2 m / e 2 (r )m 2 (r )dr (4.6.11) 

Jtk 

by using the Schwartz inequality and condition (a) of the theorem. Therefore, 

00 nOO 

Y A9jA9 k < T s X 2 m / e 2 (r)?n 2 (r)dr < 00 

Jo 


k= 1 


which implies A 9k G h and, thus, completes the proof for (i) and (ii). 
The proof for (iii) is relatively involved and is given in Section 4.8. 


□ 
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For additional reading on hybrid adaptation the reader is referred to [60] 
where the term hybrid was first introduced and to [172, 173] where different 
hybrid algorithms are introduced and analyzed. 

The hybrid adaptive law (4.6.4) may be modified to guarantee that 
9k G 7Z n belongs to a certain convex subset S of 7 Z" by using the gradi- 
ent projection method. That is, if S is defined as 

s = {e s n n \e T e < m 0 2 } 

then the hybrid adaptive law (4.6.4) with projection becomes 

Hk+i 

9 k+ i = 6 k + T e(r)0(r)dr 

Jt k 

j 9 k+ 1 if 9 k+ i G S 

k+l = l ]fed Mo 

and 9q G S. As in the continuous-time case, it can be shown that the hybrid 
adaptive law with projection has the same properties as those of (4.6.4). In 
addition it guarantees that 9 k G S, V/c > 0. The details of this analysis are 
left as an exercise for the reader. 


4.7 Summary of Adaptive Laws 

In this section, we present tables with the adaptive laws developed in the 
previous sections together with their properties. 

4.8 Parameter Convergence Proofs 

In this section, we present the proofs of the theorems and corollaries of the previous 
sections that deal with parameter convergence. These proofs are useful for the 
reader who is interested in studying the behavior and convergence properties of the 
parameter estimates. They can be omitted by the reader whose interest is mainly on 
adaptive control where parameter convergence is not part of the control objective. 

4.8.1 Useful Lemmas 

The following two lemmas are used in the proofs of corollaries and theorems pre- 
sented in this sections. 
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Table 4.1 Adaptive law based on SPR-Lyapunov design approach 


Parametric model 

z = w(s)e* T if 

Parametric model 
rewritten 

z = W(s)L(s)9* T (j), (f = L~ l (s)if 

Estimation model 

z = W(s)L(s)e T <j) 

Normalized 

estimation error 

e = z — z — W{s)L(s)en 2 s 

Adaptive law 

-O- 

t-H 

II 

•QS 

Design variables 

L _1 (s) proper and stable; W(s)L(s) proper and 
SPR; m 2 = 1 + n 2 s and n s chosen so that Too 

(e. g., n 2 = acf> T <j> for some a > 0) 

Properties 

(i) e, 9 £ Too; (h) e, en s , 0 £ T 2 


Lemma 4.8.1 (Uniform Complete Observability (UCO) with Output In- 
jection). Assume that there exists constants v > 0,fc„ > 0 such that for all to > 0, 
K(t) £ lZ nxl satisfies the inequality 



\K(t)\ 2 cIt < 


ki/ 


(4.8.1) 


Vi > 0 and some constants ko,v > 0. Then (C,A), where C £ 7 Z nxl ,A £ TZ nxn , is 
a UCO pair if and only if ( C , A + KC T ) is a UCO pair. 


Proof We show that if there exist positive constants fii,fii > 0 such that the 
observability grammian N(to,to + v ) of the system (C, A) satisfies 

fill < N(t o, to + v) < fill (4.8.2) 

then the observability grammian Ni(t, 0: to + v) of (C, A + I\C T ) satisfies 

fill < Nfitofio + v) < fi'il (4.8.3) 

for some constant ffi , fi. 2 > 0. From the definition of the observability grammian 
matrix, (4.8.3) is equivalent to 



222 


CHAPTER 4. ON-LINE PARAMETER ESTIMATION 


Table 4.2 Gradient algorithms 


Parametric model 

* = r T </> 

Estimation model 

z = 8 T (j) 

Normalized 

z — z 

estimation error 

^ 2 
m z 

A. Based on instantaneous cost 

Adaptive law 

-o- 

t-H 

II 

•Qi 

Design variables 

m 2 = 1 + n 2 , n 2 = a(j) T (f>, a > 0, T = T t > 0 

Properties 

(i) e, en s , 8,8 e Too! (ii) e, en s , 8 G C 2 

B. Based on the integral cost 

Adaptive law 

8 = —T(R8 + Q) 

R=-0R+*g-, R( 0) = 0 

Q = -PQ-& Q(o) = o 

Design variables 

m 2 = l-\-ri 2 , n s chosen so that (p/m G Coo (e. 
g., n 2 = a(f> T (f>, a>0);/3>0, r = r T >0 

Properties 

(i) e, en s , 8, 8, R, Q e Too! (ii) e,en s ,8 «E C 2 ; 
(hi) lim t ^oo 8 = 0 


Pi\xi(to)\ 2 < / \C T (t)x\(t)\ 2 dt < /3 2 |cci (to) 1 2 

■'to 

where X\ is the state of the system 

Xi = (A + KC t )xi 
yi = C T x i 

which is obtained, using output injection, from the system 


( 4 . 8 . 4 ) 


( 4 . 8 . 5 ) 


Table 4.3 Least-squares algorithms 
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Parametric model 

z = 6* T (j) 

Estimation model 

z = 9 T cj) 

Normalized 

estimation error 

CM 

<bT 

i 

o 

II 

VJJ 

A. Pure least-squares 

Adaptive law 

9 = Pe<\> 

P = ~P^P, P( 0) = Po 

Design variables 

P 0 = Pq > 0; m 2 = 1 + n 2 n s chosen so that 
(f>/m € Coo (e.g., n 2 s = a(j> T (f),a > 0 or n 2 = 
(j) T P(j)) 

Properties 

(i) e, en s , 8, 9, P € C ^ ; (ii) e,en s ,9 e C 2 ; (iii) 
lim^oo 9(t) = 9 

B. Least-squares with covariance resetting 

Adaptive law 

9 = Pe<\> 

P = -P^P, P( t +) = P 0 = po/, 

where t r is the time for which A m in{P) < Pi 

Design variables 

po > pi > 0; m 2 = 1 + n 2 , n s chosen so that 
4>/m £ Coo (e.g., n 2 = acjp (j>, a > 0 ) 

Properties 

(i) e, en s , 9,9, P £ C oo\ (ii) e, en s , 9 £ C 2 

C. Least-squares with forgetting factor 

Adaptive law 

8 = Pecfr 

p = [ PP-P^P if\\P(t)\\<Ro 

[ 0 otherwise 

P( 0) = Po 

Design variables 

m 2 = 1 + n 2 , n 2 = a<f> T (f> or (j) T Pp, (3 > 0, Rq > 

0 scalars; Po = Po > 0? -Fo | < Po 

Properties 

(i) e, en s , 9,9, P E Poo! (ii) e, en s , 9 £ C 2 
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Table 4.4 Adaptive laws for the bilinear model 


Parametric model : z = W (s) p* (0 * T + zq) 

A. SPR-Lyapunov design: sign of p* known 

Parametric model 
rewritten 

z = W{s)L{s)p*{6* T ^ + Zl ) 

<t> = z\ = L~ 1 (s)z 0 

Estimation model 

z = W(s)L(s)p(6 T 4> + z\) 

Normalized 

estimation error 

e = z — z — W(s)L(s)en 2 

Adaptive law 

0 = Te(j)sgn(p*) 

P = 1 e£, £ = 6 T (t> + z l 

Design variables 

L~ l {s) proper and stable; VP(s)L(s) proper and 
SPR; m 2 = 1 + n^; n s chosen so that ^G Coo 

(e.g. n 2 = a((j) T <f>+ z 2 ), a > 0); T = T t > 0, 7 > 0 

Properties 

(i) e, 9, p G Too; (h) e, en s , 0,p£ C- 2 

B. Gradient algorithm: sign(p*) known 

Parametric model 

z = P *(0* T <j) + Zl) 

rewritten 

4> = W(s)ip, z\ = W(s)zo 

Estimation model 

z = p(0 T (j)+ Z\) 

Normalized 

z — z 

estimation error 

m 2 

Adaptive law 

6 = Tecj)sgn(p*) 

P = £ = 6~ T (t> + z 1 

Design variables 

m? = 1 + n 2 - n s chosen so that ^ G C 00 (e.g., 

n 2 = 4 > t 4> + z 2 ); r = r T > o, 7 > 0 

Properties 

(i) e, en s , 0 , p,0,p<E Coo ; (h) e, en s , 9,p G C 2 


x = Ax 
y = C T x 

Form (4.8.5) and (4.8.6), it follows that e = X\ — x satisfies 

e = Ae + KC T x\ 


(4.8.6) 
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Table 4.4 (Continued) 


C. Gradient algorithm with projection 

Sign ( p *) and lower bound 0 < po < p* known 

Parametric model 
rewritten 

z = 6* T cj) 

9* = [6t,ef] r ,e$ = p*, 9* = p*9* 

4> = [zi,4> t ] t 

Estimation model 

z = 6 t 4> 

Normalized 

estimation error 

z — z 

€ = 2 
m z 

Adaptive law 

( if $isgn(p*) > p 0 or 

0 i=< if #isgn(p*) =p 0 and - 7 iziesgn(p*) < 0 

(O otherwise 

9-2 = T2e4> 

p = di,o= s i 

Design variables 

m 2 = 1 + n 2 \ n s chosen so that ^ G Too (e.g., n 2 = 
ac[> T ([), a > 0 ); 71 > 0; 0i(O) satisfies 0i(O)| > po; 

r 2 = rj > 0,7 > 0 

Properties 

(i) e, en s , 9 , p, 9, p G (ii) e, en s , 9,p G £ 2 

D. Gradient algorithm without projection 

Unknown sign (p*) 

Parametric model 

z = p*(9* T cj) + z 1 ) 

Estimation model 

z = N(x)p(9 T 4> + z\ ) 

N(x) = x 2 cosx 

x = w + w = e 2 m 2 , rc( 0 ) = 0 

Normalized 

estimation error 

z — z 

m 2 

Adaptive law 

9 = N{x)Ye(j> 

p = N(x)je £ = 9 T (j) + zi 

Design variables 

m 2 = l + n 2 s \ n s chosen so that ^ G £ 00 ! (e.g., 

n 2 = 4> T (j) + z 2 ); 7 > 0 , r = r T >0 

Properties 

(i) e, en s , 9, p, 9, p,x,w G £ 00 ; (ii) e, en s , 6,pe £2 
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Table 4.5. Hybrid adaptive law 


Parametric model 

z = 0* T (f) 

Estimation model 

z = &l 4>,t e [tfc,t fc+ i) 

Normalized 

estimation error 

z — z 

m 2 

Adaptive law 

9 k+1 = e k +T J^ +1 e(r)0(r)dr, k = 0, 1,2, ... , 

Design variables 

Sampling period T s = t k+1 - t k > 0, t k = kT s ; 
m 2 = 1 + n 2 and n s chosen so that \4>\/m < 1 
(e.g., n 2 s = oaf 1 4>, a > 1 ) 

r = r T > o 

2 — T s \ max (T) > 7 for some constant 7 > 0 

Properties 

(i) 9 k G lo 0 ? 677-5 G C'oo 

(ii) \9 k+1 - 9 k \ G h ; 6,677-5 € C 2 


Consider the trajectories x(t) and X\(t) with the same initial conditions. We 


have 


e(t) = ( <&(tiT)K(r)C T {t)xi{t)cLt 
1 1 0 

where 4> is the state transition matrix of (4.8.6). Defining 


Xi = 


a / I<C T x 1 /\KC T x 1 \ if \C T x\ \ ± 0 


K/\K\ if|(7 T si|=0 

we obtain, using the Schwartz inequality, that 


(4.8.7) 


|C' T (t)e(t)| 2 < [ |C T (f)$(f, t)A'(t)C t (t);ti(t) dr 

Jt 0 

< f |C T (t)4>(t,r)ii(T)|“ |A'(t)| 2 4t f |C T (r)a;i(T) | 2 dr (4.8.8) 

Jtn Jtn 
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Using the triangular inequality (a + b ) 2 < 2a 2 + 2 b 2 and (4.8.8), we have 

fto+v fto+v fto+v 

I \C T {t)xi{t)\ 2 dt < 2 / \C T {t)x{t)\ 2 dt + 2 / \C T (t)e(t)\ 2 dt 

J to «/ to to 

/•to + " 

< 2 / |C T (£);r(£)| 2 cif 

Jto 

fto+v rt 9 ft „ 

+2 / |C T (£)$(£, t)zi(t)| |X(r)| 2 dr / |C T (r)a:i(r)p drdt 

J tn J to J tn 


< 2/?2 |a?i (^o) | 2 


/'to+i' /*t /»T 

+2 / |C' T (t)4>(t,r)5i(r)|" |/iT(r)| 2 dr / |C T (r)a;i(T) |" drdt 

J tn J tn J tn 


'to J 1 0 


where the last inequality is obtained using the UCO property of (C, A) and the 
condition that x(tg) = Xi(to)- Applying the B-G Lemma, we obtain 


| C T (t ) x i (t ) \ 2 dt < 2f3 2 1 x i (t o ) | : 2 e 


By interchanging the sequence of integration, we have 


{ fto^ ft 0 2 | C ' T ( t )$(^r)x i (r)| 2 |Ar(r)| 2 drdt| 


(4.8.9) 


C T (£)4>(£, r)a’i(r) 2 |iL(r)| 2 drd£ = 


rto+vrto+v 


C (£)$(£, t)x\(t) dt\K(r)\ 2 dT 


Because (C,A) being UCO and |aq| = 1 imply that 

fto+v „ fto+v 

/ |C T (£)<!>(£, r)*i(r)| dt < / | C T (£)$(£, r) |“ dt < /3 2 

J T J to 

for any f 0 < r < f 0 + it follows from (4.8.1) and the above two equations that 


fto+v ft. 

/ / |C' t (£) 4 >(£,t) 5 i(t)|“ |A"(r)| 2 drd£ < k u (3 2 (4.8.10) 

Jtn Jtn 


>t 0 Jt 0 


and, therefore, (4.8.9) leads to 


|C ,T (t)a:i(f)| 2 cft < 2f3 2 e 2l32k, '\x 1 (to)\'' ! 


(4.8.11) 


On the other hand, using x-\ = x+e and the triangular inequality ( a+b ) 2 > \a 2 — b 2 , 
we have 

fto + V 1 fto + V fto+V 

/ |C T (£)a;i(£)| 2 d£ > - \C T {t)x{t)\ 2 dt — / \C T (t)e(t)\ 2 dt 

Jto 2 Jt 0 Jt 0 

o fto+v 

> ^|*i(to)| 2 -/ \C T (t)e(t)\ 2 dt (4.8.12) 

Z Jtn 
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where the last inequality is obtained using the UCO property of (C. A) and the fact 
x(to) = x\(to). Substituting (4.8.8) for |C T e| 2 , we obtain 


r'a ; " _ a, 

\C T (t)x\(t)\ 2 dt > -r-|a;i(to)| 2 


'to 


rto+v 


rto+v ft 


\C r (t)$(t, r)a;i(r)| 2 | K (r)| 2 drdt 


'to 


\C T (t)xi(t)\ 2 dt 
by using the fact that 

ft rto+V 

/ \CA (t)x\(t)\ 2 dr < / |C T ( r )x\{r)\ 2 dr 

Jtn Jtn 


’ to 

for any t < to + v. Using (4.8.10), we have 

rto+v 
/to 


or 


ft o-\-v a ft oA-v 

/ \C T (t)xi(t)\ 2 dt>-^-\x 1 (t 0 )\ 2 -p 2 K | C T (t)xi(t)\‘ 

Jtn 1 Jt 0 

Pi i_. /. m 2 


dt 


rto+v 


| C {t)xi(t)\ z dt > 


M*o)|" 


(4.8.13) 


2(1 + faki!) 

Setting P 1 = 2 (i+0 2 k ) ’ P 2 = 2/32e 2 ^ 2fct ', we have shown that (4.8.4) holds for 

fl x , fJ 2 > 0 and therefore (C,A + KC T ) is UCO, and, hence, the if part of the 
lemma is proved. 

The proof for the only if part is exactly the same as the if part since (C, A) can 
be obtained from (C. A + KC T ) using output injection. □ 


Lemma 4.8.2 Let H(s) be a proper stable transfer function and y 
u £ Coo, then 


ft+T ft-\-T 

J y 2 (r)dr < k\ J u 2 {r)dr + h 


H(s)u. If 


for some constants k\, k 2 > 0 and any t > 0, T > 0. Furthermore, if H(s) is strictly 
proper, then k\ < a||iL(s)||^ 0 for some constant a > 0. 


Proof Let us define 


fs(r) 


/(t) if t £ S 
0 otherwise 


where S C 1Z is a subset of TZ that can be an open or closed interval. Then we can 
write 


u(t) = U[ 0it) (r) + u [t>t+ T ] (r ) , VO <t <t + T 
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Because H(s ) can be decomposed as H(s) = ho + Hi(s ) where ho is a constant and 
-ffi(s) is strictly proper, we can express y = H(s)u as 

y(r) = hou(T) + Hi(s) {u[ 0) t) + «[t,t+T]} = h 0 u(r) + y^r) + y 2 {r) , VO < r < t + T 
where y\ = H\(s)u\o+ j ,y 2 = H\{s)u\ t t+ T\- Therefore, using the inequality 


(a + b + c) 2 < 3 a 2 + 3b 2 + 3c 2 


we have 


pt-\-T pt-\-T rt-\-T rt-\-T 

J y 2 (r)dT < 3/ig J u 2 (r)dr + 3 J y\(r)dr + 3 J ?/|(r)dr (4.8.14) 
Using Lemma 3.3.1, Remark 3.3.2, and noting that y 2 {r) = 0 for all r < 1, we have 

rt+T rt+T A 

/ yU T )dT = / yl{r)dT = \\y 2 (t+T)\\l < ll-ffi(s)||^||w [t ,t+T]lll 
Jt Jo 

where 

/» oo rt-\-T 

\\ u [t,t+T]\\l = u [t,t+T]( T )d T = u 2 (r)dT 


Hence, 


y 2 2 {r)dT < ||Zfi(s) 


u 2 (r)dr 


(4.8.15) 


To evaluate the second term in (4.8.14), we write 


2/i (r) = /u(r) * U[ 0 ,t)(r) = / /n(r - o-)u[ 0)t )(<7)d<r 

Jo 


f 0 hi(r — cr)u(a)dcr if t <t 
f 0 h\(r — a)u(a)da if t >t 


where h \( t ) is the impulse response of i?i(s). Because -ffi(s) is a strictly proper 
stable transfer function, |/ii(r — cr)| < a 1 e~ ol2 P~ cr > for some constants a i,a 2 > 0, 
then using the Schwartz inequality and the boundedness of u(r), we have 

rt+T rt+T / rt \ 2 

J y 2 ( r)dT = J h\(r — a)u(a)daj dr 

< J (/ M r-a)\da J |/ii(t — a)\u 2 (a)da^j dr 

< af e-^-^dcr j e- a2iT - a) u 2 (a)da^ dr 


rt+T 


2(T - 4) dT 


(4.8.16) 
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where a = sup a u 2 (a). Using (4.8.15) and (4.8.16) in (4.8.14) and defining k\ = 
3 {hi + \\H 1 (s)\\l 0 ),k 2 = it follows that 


j y 2 {r)dT<ki j u 2 {r)dT + k 


and the first part of the Lemma is proved. 

If H(s ) is strictly proper, then h 0 = 0, 77i(s) = H(s) and k-\ 
therefore the proof is complete. 


□ 


Lemma 4.8.3 (Properties of PE Signals) If w : 1Z + i— > 7 Z n is PE and w £ Coe 

then the following results hold: 

(i) w\ = Fw, where F £ 7 Z mxn with m < n is a constant matrix, is PE if and 
only if F is of rank m. 

(ii) Let (a) e £ C -2 or (b) e € Coo and e — > 0 as t — > oo, then u >2 = w + e is PE. 

(iii) Let e £ S(n) and e £ Coo- There exists a p* > 0 such that for all / 1 £ [0,/i*), 

= a: + e is PE. 

(iv) If in addition w £ Coo and H(s) is a stable, minimum phase, proper rational 
transfer function, then W 3 = H(s)w is PE. 


Proof The proof of (i) and (iii) is quite trivial, and is left as an exercise for the 
reader. 

To prove (iv), we need to establish that the inequality 

pt-\-T 

foT> J ( q T w 3 (T)) 2 dr>p 1 T (4.8.17) 


holds for some constants (3\,(32,T > 0, any t > 0 and any q £ R n with \q\ = 1. 

The existence of an upper bound in (4.8.17) is implied by the assumptions that 
w £ Coo, H(s ) has stable poles and therefore uj 3 £ Coo- To establish the lower 
bound, we define 

A a r T 

z {t) = , . v ,. g w 

(s + a) r 

and write 

z(t) = 7 — - — r-g T H~ 1 (s)w 3 = - — — — ^H~ 1 (s)q r w 3 
y 1 ( s + a)' w (s + a) r w 

where a > 0 is arbitrary at this moment and r > 0 is an integer that is chosen to be 

equal to the relative degree of H(s) so that H~ 1 (s) is a proper stable transfer 

function. 

According to Lemma 4.8.2, we have 


j-t+T rt+T 

J z 2 (r)dT < k\ J ( q T w 3 (r )) dr + k 


■ t+T 
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for any t,T > 0 and some k\, k 2 > 0 which may depend on a, or equivalently 
pt+T 1 / rt+T \ 

J (<? T w 3 (r)) 2 dr > — ij z 2 (T)dT-k 2 j (4.8.18) 

On the other hand, we have 


z{t) = , , w g ^ 

(s + a) r 


= + 


a' - (s + a) r \ T . 

1 q w 


s(s + a) r 


= q W + Z\ 


where 


a r — (s + a) r T . 
zi = — , , w g w 


s(s + a) 

It is shown in Appendix A (see Lemma A. 2) that 


a r — (s + a) r 


s(s + a) T 


k 

< - 
a 


for some constant k > 0 that is independent of a. Therefore, applying Lemma 4.8.2, 
we have 


rt+T 


rt+T 


zf(r)dr < — (q w) 2 dr + k: 


a 
k 4 T 
~cP 


< ^ + fc 3 


where k 3 = 3fc 2 , k 4 = fc 3 sup t \u>\' 2 and A 3 > 0 is independent of a, and the second 
inequality is obtained using the boundedness of w. Using the inequality (x + y ) 2 > 


\x 2 — y 2 , we have 


/ t+i rt+i 

z 2 {r)dT = J {q T w(t) + Zi(t)) 2 

rt+T rt+l 

J (q T w) 2 dr — J z 2 (r)d7 


1 

> - 
“ 2 


' 2 dr 

rt+T 


Because w is PE, i.e. , 


1 

> - 
“ 2 


rt+T 0 

J ( q T w) 2 dT > a 0 T 0 


rt+ V«o 2 ^-^-fc 3 


(4.8.19) 
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for some To, ao > 0 and Vf > 0, we can divide the interval [t,t + T\ into subintervals 
of length Tq and write 



no pt+iTo 
(, q T w) 2 dr > Y / 

~[Jt+(i-OT 0 


( q T w) 2 d,T > n 3 a 3 T 0 


where no is the largest integer that satisfies % < jr. From the definition of no, we 
have no > ^ — 1; therefore, we can establish the following inequality 

J^ + (q T w) 2 dT>a 0 ^-?jT 0 = a 0 (T-T 0 ) (4.8.20) 

Because the above analysis holds for any T > 0, we assume that T > T 0 . Using 
(4.8.19) and (4.8.20) in (4.8.18), we have 

[ t+T (q T w 3 ) 2 dT > 1(^£(T — T 0 )-^T-fc 3 -fe 


ki 


1 lY «o k 4 


k\ 


T — 


^o-qTq 


+ ^3 + ^2) \ (4.8.21) 


Since (4.8.21) holds for any a,T > 0 and a 3 ,k 4 are independent of a, we can first 
choose a to satisfy 


Ot 0 k 4 Q!o 

~2 ~ ~cfl - T 


and then fix T so that 


o^T _ + k 3 + k2 ^ > /3 lkl T 


for a fixed pi > 0. It follows that 

rt+T 


j (q T w 3 (t)) 2 dr > Pi T > 0 

and the lower bound in (4.8.17) is established. 


□ 


Lemma 4.8.4 Consider the system 

Yi = A c Y 4 - B c 4> t Y 2 

Y 2 = 0 (4.8.22) 

yo = cjY 1 

where A c is a stable matrix, ( C c ,A c ) is observable, and <fi € Too- If <t>f defined as 

<j>f = Cj (si — Ay l B c (j) 
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satisfies 


i rt-t-i o 

a\I < — / 0/(t)0J(t)c?t < a 2 I, 'it > 0 (4.8.23) 

to Jt 

for some constants <* 1 , 0 : 2 , To > 0, then (4-8.22) is UCO. 

Proof The UCO of (4.8.22) follows if we establish that the observability grammian 
A f(t, t + T) of (4.8.22) defined as 

rt+T 


a r + 

Af(t, t + T) = J $ T (r,t)CC T §(T,f)<h 


where C = [CjO] 7 ^ satisfies 


pi > A f(t, t + T) > al 


for some constant a, P > 0, where 4>(t,t 0 ) is the state transition matrix of (4.8.22). 
The upper bound pi follows from the boundedness of <!>(/:, f 0 ) that is implied by 
c t> € Coo and the fact that A c is a stable matrix. The lower bound will follow if we 
establish the following inequality: 


rt+T 


2/o(r)dr>a(|Yi(i)| 2 + |y 2 | 2 ) 

where Y 2 is independent of t, due to y 2 = 0. From (4.8.22), we can write 

Vo(t) = Cj Y ± ( t) = Cj e+^Y^t) - £ Cj e A ^ T ~^ B c <j> T (a)daY 2 


A 


= 2/1 (t) + 2/2 (r) 

forallr > t, where yi(r)= Cj e A °0~ t '>Yi(t), 2/2(1") = — + Cje Ac< - T ~OB c p T (a)da ho- 
using the inequalities ( x + y ) 2 > ^ — y 2 and ( x + y ) 2 > % — x 2 with x = y \ , y = yi 
over the intervals [t,t + T ], [t + T , t + T\, respectively, we have 


f T V+)dr > / 


t+T „ 2 


2/rO) 


rt+T 


dr — 


yl+dr 


rt+T .,2 


2/1 ( T ) 


rt+T 


dr — 


It+T' 


I t+T' 


2/i (r) dr 


(4.8.24) 


for any 0 < T < T. We now evaluate each term on the right-hand side of (4.8.24). 
Because A c is a stable matrix, it follows that 

|2/i(r)| < k ie -^P-t)\ Yl (t)\ 
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for some fci, 7 i >0, and, therefore, 

rt+T 


I yi{ T )dT < -M-e -27lT |y 1 (t )| 2 
Jt+T' ^7l 

On the other hand, since (C c ,A c ) is observable, we have 
rt+T 

J e A ° {t ~ T) C c Cj e Ac(t ~ T) dT > k 2 I 

for any T > T) and some constants k 2 ,Ti > 0. Hence, 

rt+T 

J Vi{ T )dT > k 2 \Yi(t)\ 2 

Using y 2 {r) = — c/>J ( t)Y 2 and the fact that 


(4.8.25) 


(4.8.26) 


rt+T 

a 2 niT 0 I > / 4 >f(j)jdT > ri^aiTM 
Jt+T 1 


where no,ni is the largest and smallest integer respectively that satisfy 

T-T 

no < —= — < ni 

i.e. , no > d+A 1, n+ < ++ + 1, we can establish the following inequalities sat- 


isfied by y 2 : 


£ yl+)dT < a 2 T 0 | ^ + 1 j \Y 2 

rt+T 

/ yl{ T )dT > aiT 0 

Jt+T' 


T-T 


-1 |U 2 | 2 


(4.8.27) 


Using (4.8.25), (4.8.26), (4.8.27) in (4.8.24), we have 


rt+T 


yl{r)dT > 


fcg _ fcf -271 T 
2 2 7l 


\Yi(t+ 


aiT 0 T-T 


To 


- 1 ] - a 2 T 0 ( ^ + 1 ] ] |Y 2 | 
-to 


(4.8.28) 


Because the inequality (4.8.28) is satisfied for all T,T with T > Tf, let us first 
choose T such that T > T\ and 

_ k’l 271 t > ^2 

2 2 7 i “4 
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Now choose T to satisfy 

ai T 0 (t-T‘ 


Tn 


\ / T \ 

- 1 — o^To — + 1 > Pi 

\ Jn 


for a fixed Pi. We then have 

rt+T 


yl{r)dr > a (| hi(f)| 2 + |y 2 (i)| 5 
Jt 

where a = min {Pi , ^}. Hence, (4.8.22) is UCO. 


□ 


4.8.2 Proof of Corollary 4.3.1 

Consider equations (4.3.30), (4.3.35), i.e., 

e = A c e + B c (—0 T (f) — en 2 s ) 

0 = T<f>e (4.8.29) 

e=Cje 


that describe the stability properties of the adaptive law. In proving Theorem 4.3.1, 
we have also shown that the time derivative V of 


V = 


e 1 P r e d T T~ l 6 


where T = T t > 0 and P c = Pj > 0, satisfies 


V < —ve 


,/,2 


(4.8.30) 


for some constant v >0. Defining 


m = 


A c - B c Cj n 2 - B c 


C=[CJ 0]', P = 


rficj o 

and x = [e T ,0 T ] T , we rewrite (4.8.29) as 

x = A(t) x, e = C' T x 

and express the above Lyapunov-like function V and its derivative V as 


Pc o 

o r- 1 


V = x T Px 

V = 2x T PAx + x T Px 

= x T (PA + A T P + P)x < —v'x T CC T x = —v'e 2 
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where P = 0. It therefore follows that P, as defined above, satisfies the inequality 


A T (t)P + PA(t) + v'cC T < O 


Using Theorem 3.4.8, we can establish that the equilibrium e e = 0 ,9 e = 0 (i.e., 
x e = 0) of (4.8.29) is u.a.s, equivalently e.s., provided (C,A) is a UCO pair. 

According to Lemma 4.8.1, (C, A) and (C, A + KC T ) have the same UCO 
property, where 


K = 


B c n 2 s 


is bounded. We can therefore establish that (4.8.29) is UCO by showing that 
(C, A + KC t ) is a UCO pair. We write the system corresponding to (C, A + KC T ) 
as 

U = A c Y-, - B c c\> T Y 2 

Y 2 = 0 (4.8.31) 

Vo = CjY 1 


Because 4> is PE and Cj (si — A C )~ 1 B C is stable and minimum phase (which is 
implied by Cj (si — A c )~ l B c being SPR) and <\> £ £oo, it follows from Lemma 4.8.3 
(iii) that 

<t > f( T ) = j T Cje A ^ T -^B c <l>(a)da 
is also PE; therefore, there exist constants ai,a 2 ,T 0 > 0 such that 


1 /*t+7o 

a 2 I>— (j>f(T)(ff (r)dr > ail, Vt > 0 
Jt 

Hence, applying Lemma 4.8.4 to the system (4.8.31), we conclude that (C, A + 
A'C t ) is UCO which implies that (C,A) is UCO. Therefore, we conclude that the 
equilibrium 9 e = 0, e e = 0 of (4.8.29) is e.s. in the large. □ 


4.8.3 Proof of Theorem 4.3.2 (iii) 

The parameter error equation (4.3.53) may be written as 


0 = A(t)9 
y 0 = C T (t)9 


(4.8.32) 


where A(t) = — -,C T (t) = — y 0 = em. The system (4.8.32) is analyzed 
using the Lyapunov-like function 


2 
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that led to 


( 9 T d >) 2 2 2 

V = -± — = — e 2 m 2 


in * 


along the solution of (4.8.32). We need to establish that the equilibrium 9 e = 0 of 
(4.8.32) is e.s. We achieve that by using Theorem 3.4.8 as follows: Let P = T _1 , 


then V = H 2 P ^ and 


V= ^9 T [PA(t) + A T (t)P + P]9 = -d T C(t)C T (t)e 

where P = 0. This implies that 

P + PA(t) + A T P + 2 C(t)C T {t) < 0 

and according to Theorem 3.4.8, 9 e = 0 is e.s. provided (C, A) is UCO. Using 
Lemma 4.8.1, we have that (C, A) is UCO if (C, A + KC T ) is UCO for some K that 
satisfies the condition of Lemma 4.8.1. We choose 



m 


leading to A + KC T = 0. We consider the following system that corresponds to the 
pair (C, A + KC T ), i.e., 


Y = 0 

yo = C r Y=-^Y 


(4.8.33) 


The observability grammian of (4.8.33) is given by 


A f(t, t + T ) 



<t>(T)(f> T ( T ) 
to 2 (t) 


Because </> is PE and m > 1 is bounded, it follows immediately that the grammian 
matrix A f(t, t + T) is positive definite for some T > 0 and for allt > 0, which implies 
that (4.8.33) is UCO which in turn implies that (C, A) is UCO; thus, the proof is 
complete. □ 


In the following, we give an alternative proof of Theorem 4.3.2 (iii), which does 
not make use of the UCO property. 

From (4.3.54), we have 

V(t + T) = V(t ) - [ + e 2 m 2 dr = V(t) - [ + (4.8.34) 

Jt Jt m 
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for any t,T > 0. Expressing 6 T (t)4>{t) as 0 T (r)</>(r) = 0 T (f)^(r) + ( 9(t ) — 
9{t)) T (j){r) and using the inequality ( x + y ) 2 > bx 2 — y 2 , it follows that 


r t+T { F { r )^)) 2 ^ > 


r t+T 


> 


m 

J_ fl 
rn () 1 2 J t 

rt.+T 


mo J t 


(0 T (r)0(r)) 


dr 




(0 T (t)^(r)) 2 (ir 


t) ~ 0(f)) T </>(r) ) dr 


(4.8.35) 


where mo = sup t>0 m 2 (t) is a constant. Because </> is PE, i.e., 

rt+T 0 


/ t-t-io 

4>{t)(/> t ( r)dr > a 0 do-f 


for some To and «o > 0, we have 

r t + T o /.. % 2 


(d T dr > a 0 T 0 6 T (t)6(t) 


> 2a 0 T 0 \- 1 ax (T~ 1 )V(t) 
— 2ttoT()A m m (E)V(t) 


On the other hand, we can write 


0(t) — 9(t) = [ 9(a)da = — [ T- — ^ - cj)(a)da 

Jt Jt m z (a) 


and 


(d(r) - 0(t)) T <(-(r) = - 


rr / nT 


6» T (CT)^(cr)r^(cr) 


TO* 


0(r)do 


d T (CT)((»(cr) (/> T (T)r</>(tr) 


(4.8.36) 


der (4.8.37) 


dt to(ct) m(tr) 

Noting that m > 1, it follows from (4.8.37) and the Schwartz inequality that 

r t + T o / .. _ x 2 

(r) - 6»(t)) T ^(r) ) dr 


rt+To / r /iT 


< 


0 T (r)r^(cr) 


r r / /jt 


<i(T 


6> T (a)^(c7) 


/ t-M 0 /* 

(r-t) J 


m(cr) 

t+To rT ( &{?)*{?) 


m(a) 
\ 2 


dcr dr 


m 


O) 


derdr 
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where /? = sup T>0 |</>(r)|. Changing the sequence of integration, we have 


r t + T o / .. _ _ n 2 

(r) - 0{t)) T (j){T) ) dr 


< /? 4 ALx(r) j t 

< 


t+T 0 / flT 




i(ct) 


o 

/ (r — t)drda 

J (7 


t+T 0 / nT 


{9 (cr)^(cr)) 2 / Tq ~(<T-t) 


< 


m 2 {a) 

/3 4 A Lx(r)T 0 2 f t+To (fl~ T (q)<Xq)) 2 

2 J t m 2 (a ) 


der 


d<7 


(4.8.38) 


Using (4.8.36) and (4.8.38) in (4.8.35) with T = Tq we have 


r t+T ° (0 T (r)^(r)) 2 rfr > Q 0 r 0 A mm (r) F(f) 


2 (r) 


which implies 

/■ i+T » (0 T (r)0(r)) 2 


too 

34^2 \ 2 


P 4 T 2 \ 2 max (T) 
2 toq 


(■t+To 


m 2 (cr) 


d(J 


dr > 


CxCT}\min (r) 


U(t) 


J t m 2 (r) i i p^pAma^ 1 ; mo 

= 7 iU(t) (4.8.39) 

where 71 = 2 ino+J°r^^ r (r) • Using (4.8.39) in (4.8.34) with T = T 0 , it follows that 
V(t + T 0 ) < V(t) - 71 V (t) = yV(t) (4.8.40) 


where 7 = 1 — 71 . Because 71 > 0 and V(t + T 0 ) > 0, we have 0 < 7 < 1. Because 
(4.8.40) holds for all t> 0, we can take t = (n— 1)T 0 and use (4.8.40) successively 
to obtain 


V(t) < V{nT 0 ) < 7 n U( 0 ), Vt > nT 0 , n = 0, 1, . . . 


Hence, V(t) — > 0 as t — > 00 exponentially fast which implies that 9(t) — > 0 as t — > 00 
exponentially fast. □ 


4.8.4 Proof of Theorem 4.3.3 (iv) 

In proving Theorem 4.3.3 (i) to (iii), we have shown that (see equation (4.3.64)) 

V(t) = -e T (t)R(t)9(t) 
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where V = §Tr 2 1§ . 


From equation (4.3.60), we have 



e -/3 ( t-r) 


<Xp)</> t (t) 

2 7 t dT 

m z (r) 


Because </> is PE and m is bounded, we have 


m = 


> 

> 


’ t-T 0 


„- g(t - T )0 (T)0 T (r) 

m 2 (r) 


nt—T 0 


dr 


.-0(t-T) < t>( T )<t> T (T) 

m 2 (T) 


dr 


' C -PT 0 


a n e 


(/>(t)(/> t (r)fir 


It-Tn 


fhe- 0To I 


for any t > T 0 , where (3\ = ao a cTo> a o = su Pt m ht) anc ^ a oFo > 0 are constants 
given by (4.3.40) in the definition of PE. Therefore, 


V < -foe-^FO < —2f3i X m in (r ) e _/3T ° V 
for t > Xb> which implies that V(t) satisfies 

V(t) < e"“ (t - To V(T 0 ), t >Tq 

where a = 2/3ie~^ T °X m i n (T). Thus, V(t) — > 0 as t — > oo exponentially fast with a 
rate equal to a. Using ^2\ min (T)V < \9\ < \/2X max (T)V, we have that 


|0(*)l < \J 2X max (T ) V (T 0 )e~ 


(t-To) ^ FmaxiA") 

V X m in( r) 


\0(T o )\e~ 


(' t-T 0 ) 

5 


t >T 0 


Thus, 0(f) — > 9* exponentially fast with a rate of ^ as t — > oo. Furthermore, for 
r = 7 /, a = 2/3i'ye~ l3T ° and the rate of convergence (a/2) can be made large by 
increasing the value of the adaptive gain. □ 


4.8.5 Proof of Theorem 4.3.4 (iv) 


In proving Theorem 4.3.4 (i) to (iii), we have shown that 0(f) satisfies the following 
equation 


m = p^Po^m 


We now show that P(t) — > 0 as t — » 00 when <j> satisfies the PE assumption. 
Because P _1 satisfies 
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using the condition that <j> is PE, i.e., ff +T ° 0('r)0 T (r)dr > aoT 0 I for some constant 
ao , T 0 > 0, it follows that 




-P 1 (0) = [ > n 0 

Jo ™ 


ao?o 


/ > 


/ _t QqTq/ 

\T 0 ) m 


where to = sup t {m 2 (t)} and no is the largest integer that satisfies no < ijr, i.e., 
n 0 > ijr — 1. Therefore, 


P~\t) 


which, in turn, implies that 


> 


> 


P _1 (0) + 




®qTq j 
fh 


ooTqj 

fh 


Mt > T 0 


m< 




Mt > T 0 


(4.8.41) 


Because P(t) > 0 for all t > 0 and the right-hand side of (4.8.41) goes to zero as 
t — y oo, we can conclude that P(t) — > 0 as t — » oo. Hence, 9{t) = P(t)P o _1 0( 0) — > 0 
as t — > oo. □ 


4.8.6 Proof of Corollary 4.3.2 

Let us denote T = P _1 (t), then from (4.3.78) we have 

t = -pr + r(o) = r T (o) = r 0 = Pq - 1 

or 

r(t) = e~ 0t To + f e -/3(t-r) 

Jo 

Using the condition that <j>(t ) is PE and m € Too, we can show that for all t > Tq 

r (t) > [* 

Jo m 

- f 

Jt-To 

> e -? T °^I (4.8.42) 

m 

where fh = sup t m 2 (t). For t < Tq, we have 

r(t) > e-^r 0 > e-^To > A min (r 0 )e- /3To I 


„-0(t-r) 0( T ) < ft T ( T ) 


rt—Tn 


dr - 




dr 


(4.8.43) 
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Conditions (4.8.42), (4.8.43) imply that 

T(<) > 7i7 (4.8.44) 

for all t > 0 where 71 = minj^pi, A min (r 0 )}e _/3To . 

On the other hand, using the boundedness of <fi, we can establish that for some 
constant 0 2 > 0 

r(t) < r 0 + p 2 f e-^-^dri 

Jo 

02 

< A roax (r 0 )/+^/< 72 / (4.8.45) 

where 72 = A max (T 0 ) + ^ > 0. 

Combining (4.8.44) and (4.8.45), we conclude 

7i I < L(t) < 72 1 

for some 71 >0,72 >0. Therefore, 

M 1][ < p (t) < li 11 

and consequently P(f),P _1 (f) € £oo- Because P(t),P _1 (f) G C 00, the exponential 
convergence of d to 0* can be proved using exactly the same procedure and argu- 
ments as in the proof of Theorem 4.3.2. □ 


4.8.7 Proof of Theorem 4.5. 1 (iii) 

Consider the following differential equations which describe the behavior of the 
adaptive law (see (4.5.6) and (4.5.7)): 

e = A c e + B c (— p*9 T — en 2 ) 

9 = Te(j)Sgn(p*) (4.8.46) 

P = 7e£ 
e = Cje 

Because £ C 0 0 and £ G C 2 , we can treat as external input functions and 
write (4.8.46) as 

Xa = A a X a + Pa(-pC) (4.8.47) 

where 



A [ A c - n 2 B c Cj —p* B c (f> T 
a ~ [ r S gn(p*)0C'J 0 
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In proving Corollary 4.3.1, we have shown that when <j> is PE and <f>, (j> £ Coo, the 
system x = A a x is e.s. Therefore, the state transition matrix 4> a (f,fo) of (4.8.47) 
satisfies 

||*aM)|| <« 0 e- 7ot (4.8.48) 

for some constants ao, jo > 0, which together with ~B a p^ £ C 2 imply that x a (t) — > 
0 as t — > 00 . 

4.8.8 Proof of Theorem 4.6.1 (iii) 

From the proof of Theorem 4.6.1 (i) to (ii), we have the inequality (see (4.6.9)) 


rtk+i 

V{k + 1) - V(k) < -(2 - T s X m ) / e 2 (r)?Ti 2 (r)ci 

Jtk 


(4.8.49) 


Using inequality (4.8.49) consecutively, we have 


rtk+n 


V(k + n) — V(k) < — (2 — T s X m ) / e 2 (r)?n 2 (r)(ir 

Jt k 

rtk+i+i 

= -(2 - T s A ro )y: / e 2 (r)m 2 (r)dT (4.8.50) 
*= 0 •'**+< 


for any integer n. We now write 


Pk+i+1 ftk+i+l (0j +i <j>(T) 

^2 ^ — 2 { ^\ _ / V / _d T 


' tk+i 


r)m 2 (r)dr = 




m- 


! (r) 


r t fc+i+ i (oJ<(>(t) + {6 k +i ~ &k) T 4 >{t )^ 


' t k-\-i 


\r) 


dr 


Using the inequality (x + y) 2 > \x 2 — j/ 2 , we write 


rtk+i+l 


e 2 (r)m 2 (r)dr > - 


2 / - - T '2 


X ftk+i+i (6^ </>(r) J /•tfc+i+i((0 fc+ j - 0 k ) <j>(r) 

dr — / — — dr 


' tk + i 


2 •/**+, ™ 2 


(r) 


' tk+i 


Because </>(t)/to(t) is bounded, we denote c = sup and have 

2 


2 ( t ) 


(4.8.51) 


tfc+i+l ( (#fc+i — $fc) <P(t 


Jt k+i ™ 2 ( T ) 

From the hybrid adaptive algorithm, we have 


-dr < cT s \9 k +i - 9k\ 2 


9k+i-0 k = / e(r)0(r)dr, i=l,2, ...,n 

Jt,, 
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therefore, using the Schwartz inequality and the boundedness of \<j>(t)\/m(t), 


1 9k+i — 0k\ 2 < 


tk+i \ 2 / rt 

|e(r)||^(r)|dT ) = 


tk 

tk + i 


e ( r )l TO ( r )^7^r dT 


< 


f e 2 (T)m 2 (T)dT f 
J ” t} c 


tk 

tk+ ' \Hr)\ 2 


i(r) 


m 2 (r) 


dr 


< ciT s 
ciT„ 


rtk+i 


lt k 

rtk+n 


'tfc 


2 (T)m 2 {r)di 


2 (r)m 2 (T)dT 


(4.8.52) 


Using the expression (4.8.52) in (4.8.51), we have 


ptk + i+1 1 rtk+i+1 

/ e 2 (r)m 2 (r)dr > - / 




f^fc + n 


' £/c + i 


2 W 


dr — c 2 iT 2 / e 2 (r)TO 2 (r)dr 


>tk 


which leads to 

ftk+n 


>t k 


e 2 (r)m 2 (r)dr = ^ 


” 3 /*tfc + i + l 


2 (r)m 2 (r)dT 


i=0 





rtfe+n 


2 (r) 


-dr — c 2 iT 2 / e 2 (r)m 2 (r)(ir 


'^fc 


- £ at; [“*' e *(T)m*(T)dT 
( T ) Ak 


0 fe T 


or equivalently 

rtk+n. 


r “ + " ' ,(T) ' ,T(T) -- n( " - 11 '“ + " «’(r)m J (r )dr 


't k 2m 2 (t) 


-drd k - 


i = o 
-c 2 T; 


'ifc 


e 2 (r)m 2 (r)dr > 


9 t 


; *fc 


f t ' c+ " </>(t)</> T (t) 
?n 2 (r) 


dr0 fc (4.8.53) 


'tk 


2(1 + n(n — l)c 2 T 2 /2) 

Because 0 is PE and 1 < m < oo, there exist constants oq, a 2 and To > 0 such that 

r t+To ^(r)</> T (r) 


a 2 I > 


-dr > a x I 


for any t. Therefore, for any integer k,n where n satisfies nT s > To, we have 

f tk+n 0W^ T (r) J 2 ^ iprrpi ^ V(k) i 

/ 2 dT ° k - a Nk Ok > -T 

/ tk m * m 




(4.8.54) 
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Using (4.8.53), (4.8.54) in (4.8.50), we obtain the following inequality: 


V(k + n) — V(k) < 


(2 - TsXm)ttl 

A m (2 + n(n — 1 )c 2 T s 2 ) 


V(k) 


(4.8.55) 


hold for any integer n with n > Tq/T s . Condition (4.8.55) is equivalent to 


V(k + n) < 'yV ( k ) 


with 

A (2 - T s \ m )oi 1 ^ 

7 ~ A m (2 + n(n — l)c 2 T s 2 ) 7 

Therefore, 

V ( kn ) = V (( k—)n + n) < jV ((fc — l)n) < -y 2 V ((k — 2 )n) < ... < q fc U (0) 


or 


\0ku\ < 




(4.8.56) 


Because 0 < 7 < 1 and, therefore, < 1, (4.8.56) implies that \6kn\ — > 0 expo- 
nentially fast as t — * 00 , which, together with the property of the hybrid adaptive 
algorithm (i.e., \0k+i\ < | 0 fe|), implies that 9k converges to 9* exponentially fast and 
the proof is complete. 


4.9 Problems 

4.1 Consider the differential equation 

9 = — 7 u 2 9 

given by (4.2.7) where 7 > 0. Show that a necessary and sufficient condition 
for 9{t) to converge to zero exponentially fast is that u(t) satisfies (4.2.11), 
i.e., 

/‘t+T’o 

J u 2 (t)cIt > aoT 0 

for all t > 0 and some constants ao,To > 0- (Hint: Show that 

e -7/ t > a (r)« I r < (t _ tl) 


for all t > t\ and some cc, 70 >0 if and only if (4.2.11) is satisfied.) 
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4.2 Consider the second-order stable system 



an 

0 12 

x + 

' b 1 ' 

X = 

a 2 i 

0 

^2 



where x, u are available for measurement, u € Coo and an, ai2, 021,61, 62 are 
unknown parameters. Design an on-line estimator to estimate the unknown 
parameters. Simulate your scheme using an = —0.25, ai2 = 3, 021 = —5, 61 = 
1,6 2 = 2.2 and u = I0sin2t. Repeat the simulation when u = 10sin2t + 
7cos3.6t. Comment on your results. 

4.3 Consider the nonlinear system 

x = ai/i(x) + 02/2(2:) + bigi(x)u + b 2 g 2 {x)u 

where u, x £ 1Z 1 , fi,gi are known nonlinear functions of x and aj,6; are 
unknown constant parameters. The system is such that u £ Coo implies 
x £ Coo ■ If x,u can be measured at each time, design an estimation scheme 
for estimating the unknown parameters on-line. 

4.4 Design and analyze an on-line estimation scheme for estimating 9* in (4.3.26) 

when L(s ) is chosen so that W(s)L(s) is biproper and SPR. 

4.5 Design an on-line estimation scheme to estimate the coefficients of the numer- 

ator polynomial 


Z(s) — b n ^is n 1 + b n — 2 s n 2 + • • • + bis + b 0 


of the plant 


V = 


R(s) 


u 


when the coefficients of R(s ) = s n + a„_is" 1 + • • • + ais + ao are known. 
Repeat the same problem when Z(s) is known and R(s) is unknown. 


4.6 Consider the cost function given by (4.3.51), i.e. 


J{9) 


(z — 9 T <j)) 2 
2m 2 


which we like to minimize w.r.t. 9. Derive the nonrecursive algorithm 


m = 



provided (f 1 <j> ^ 0 by solving the equation VJ(0) = 0. 

4.7 Show that u>o = Fu>, where F £ TZ mxn with m < n is a constant matrix and 
u £ Coo is PE, is PE if and only if F is of full rank. 
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4.8 Show that if lo,lo € u> is PE and either 

(a) e G C '2 or 

(b) e € £oo and e(t) — > 0 as f — > oo 
is satisfied, then w 0 = w + e is PE. 

4.9 Consider the mass-spring-damper system shown in Figure 4.11. 



Figure 4.11 The mass-spring-damper system for Problem 4.9. 

where [3 is the damping coefficient, k is the spring constant, u is the external 
force, and y(t) is the displacement of the mass to resulting from the force u. 

(a) Verify that the equations of the motion that describe the dynamic be- 

havior of the system under small displacements are 

my + j3y + ky = u 

(b) Design a gradient algorithm to estimate the constants to, /?, k when y, u 
can be measured at each time t. 

(c) Repeat (b) for a least squares algorithm. 

(d) Simulate your algorithms in (b) and (c) on a digital computer by assum- 
ing that m = 20 kg, /3 = 0.1 kg/sec, k = 5 kg/sec and inputs u of your 
choice. 

(e) Repeat (d) when to = 20 kg for 0 < t < 20 sec and to = 20(2 — 

e — o.oi(t— 2 o)) kg f or ^ > 20sec. 

4.10 Consider the mass-spring-damper system shown in Figure 4.12. 




Figure 4.12 The mass-spring-damper system for Problem 4.10. 
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(a) Verify that the equations of motion are given by 

k(yi -yi ) = u 

Kyi - y 2 ) = rny 2 + (3y 2 

(b) If Vi 1 2/2 , u can be measured at each time t , design an on-line parameter 

estimator to estimate the constants k, m and (3. 

(c) We have the a priori knowledge that 0 < /? < 1, k > 0.1 and m > 

10. Modify your estimator in (b) to take advantage of this a priori 
knowledge. 

(d) Simulate your algorithm in (b) and (c) when j3 = 0.2 kg/sec, m = 15 kg, 

k = 2 kg/sec 2 and u = 5 sin 2t + 10.5 kg • m/sec 2 . 

4.11 Consider the block diagram of a steer-by-wire system of an automobile shown 
in Figure 4.13. 



Figure 4.13 Block diagram of a steer- by- wire system for Problem 4.11. 


where r is the steering command in degrees, 9 P is the pinion angle in degrees 
and 9 is the yaw rate in degree/sec. The transfer functions G 0 (s),Gi(s) are 
of the form 

r = ^0 

0 s 2 + 2£oWos + ujg (1 - k 0 ) 

r m - fc i^i 

1 s 2 + 2^iWiS + io\ 

where ko, woj C 01 k±, u>i, ^1 are functions of the speed of the vehicle. Assuming 
that r,9 p ,9 can be measured at each time t, do the following: 

(a) Design an on-line parameter estimator to estimate w», * = 0,1 using 

the measurement of 9 p ,9,r. 

(b) Consider the values of the parameters shown in Table 4.6 at different 
speeds: 


Table 4.6 Parameter values for the SBW system 


Speed V 

k 0 

UJ 0 

Co 

ki 

U)1 

Cl 

30 mph 

0.81 

19.75 

0.31 

0.064 

14.0 

0.365 

60 mph 

0.77 

19.0 

0.27 

0.09 

13.5 

0.505 
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Assume that between speeds the parameters vary linearly. Use these 
values to simulate and test your algorithm in (a) when 

(i) r = 10 sin 0.2t + 8 degrees and V = 20 mph. 

(ii) r = 5 degrees and the vehicle speeds up from V = 30 mph to 
V = 60 mph in 40 second with constant acceleration and remains 
at 60 mph for 10 second. 

4.12 Show that the hybrid adaptive law with projection presented in Section 4.6 
guarantees the same properties as the hybrid adaptive law without projection. 

4.13 Consider the equation of the motion of the mass-spring-damper system given 
in Problem 4.9, i.e., 

my + (3y + ky = u 

This system may be written in the form: 

y = p*(u- my- /3y) 

where P* = \ appears in a bilinear form with the other unknown parameters 
m, (3. Use the adaptive law based on the bilinear parametric model to estimate 
p*,m,/3 when u, y are the only signals available for measurement. Because 
k > 0, the sign of p* may be assumed known. Simulate your adaptive law 
using the numerical values given in (d) and (e) of Problem 4.9. 



Chapter 5 


Parameter Identifiers and 
Adaptive Observers 

5.1 Introduction 

In Chapter 4, we developed a wide class of on-line parameter estimation 
schemes for estimating the unknown parameter vector 6* that appears in 
certain general linear and bilinear parametric models. As shown in Chapter 
2, these models are parameterizations of LTI plants, as well as of some special 
classes of nonlinear plants. 

In this chapter we use the results of Chapter 4 to design parameter iden- 
tifiers and adaptive observers for stable LTI plants. We define parameter 
identifiers as the on-line estimation schemes that guarantee convergence of 
the estimated parameters to the unknown parameter values. The design of 
such schemes includes the selection of the plant input so that a certain signal 
vector fi, which contains the I/O measurements, is PE. As shown in Chapter 
4, the PE property of fi guarantees convergence of the estimated parame- 
ters to the unknown parameter values. A significant part of Section 5.2 is 
devoted to the characterization of the class of plant inputs that guarantee 
the PE property of fi. The rest of Section 5.2 is devoted to the design of 
parameter identifiers for plants with full and partial state measurements. 

In Section 5.3 we consider the design of schemes that simultaneously 
estimate the plant state variables and parameters by processing the plant 
I/O measurements on-line. We refer to such schemes as adaptive observers. 
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The design of an adaptive observer is based on the combination of a state 
observer that could be used to estimate the state variables of a particular 
plant state-space representation with an on-line estimation scheme. The 
choice of the plant state-space representation is crucial for the design and 
stability analysis of the adaptive observer. We present several different plant 
state-space representations that we then use to design and analyze stable 
adaptive observers in Sections 5.3 to 5.5. In Section 5.6, we present all the 
lengthy proofs dealing with parameter convergence. 

The stability properties of the adaptive observers developed are based on 
the assumption that the plant is stable and the plant input is bounded. This 
assumption is relaxed in Chapter 7, where adaptive observers are combined 
with control laws to form adaptive control schemes for unknown and possibly 
unstable plants. 

5.2 Parameter Identifiers 

Consider the LTI plant represented by the vector differential equation 

y = C T x 

where x € IZ n ,u £ 7Z q ,y £ 7Z m and n > m. The elements of the plant 
input u are piecewise continuous, uniformly bounded functions of time. The 
plant parameters A £ 7 Z nxn , B £ 7Z nxq , C £ 7 Z nxm are unknown constant 
matrices, and A is a stable matrix. We assume that the plant (5.2.1) is 
completely controllable and completely observable i.e. , (5.2.1) is a minimal 
state-space representation of the plant. 

The objective of this section is to design on-line parameter estimation 
schemes that guarantee convergence of the estimated plant parameters to the 
true ones. We refer to this class of schemes as parameter identifiers. The 
plant parameters to be estimated are the constant matrices A, B, C in (5.2.1) 
or any other set of parameters in an equivalent plant parameterization. The 
design of a parameter identifier consists of two steps. 

In the first step we design an adaptive law for estimating the parameters 
of a convenient parameterization of the plant (5.2.1) by following any one of 
the procedures developed in Chapter 4. 
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In the second step, we select the input u so that the adaptive law designed 
in the first step guarantees that the estimated parameters converge to the 
true ones. 

As shown in Chapter 4, parameter convergence requires additional con- 
ditions on a signal vector (j> that are independent of the type of the adaptive 
law employed. In general the signal vector 0 is related to the plant input or 
external input command u through the equation 

4> = H(s)u (5.2.2) 

where H(s) is some proper transfer matrix with stable poles. The objective 
of the second step is to choose the input u so that 4> is PP, which, as shown 
in Chapter 4, guarantees convergence of the estimated parameters to their 
true values. 

Before we embark on the design of parameter identifiers, let us first 
characterize the class of input signals u that guarantee the PE property of 
the signal vector (j> in (5.2.2). The relationship between 0, H(s) and the 
plant equation (5.2.1) will be explained in the sections to follow. 

5.2.1 Sufficiently Rich Signals 

Let us consider the first order plant 


y = -ay + bu , y( 0 ) = y 0 


or 

y = — 7 — u (5.2.3) 

s + a 

where a > 0 , b are unknown constants and y, u £ Too are measured at each 
time t. 

We would like to estimate a, b by properly processing the input/output 
data y. u. It is clear that for the estimation of a, b to be possible, the 
input/output data should contain sufficient information about a, b. 

For example, for u = 0 the output 

y (t) = e~ at y 0 

carries no information about the parameter b. If, in addition, ijq = 0 then 
y{t) = 0 Vi > 0 obviously carries no information about any one of the plant 
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parameters. If we now choose u = constant cq ^ 0, then 

y(t) = e~ at (yo - -c 0 ) + -Co 
\ a ) a 

carries sufficient information about a,b provided yo 7 ^ aco/b. This informa- 
tion disappears exponentially fast, and at steady state 

b 

V ~ -co 
a 

carries information about the zero frequency gain | of the plant only, which 
is not sufficient to determine a, b uniquely. 

Let us now choose u(t) = sin wot for some wo > 0. The steady-state 
response of the plant (5.2.3) is given by 

y ~ Asm(co 0 t + <p) 


where 

A = -. ^ r = . 161 f = (sgn (b) - 1)90° - tan ” 1 — (5.2.4) 

|jw 0 + a | 2 + a 2 a 

It is clear that measurements of the amplitude A and phase <p uniquely 
determine a, b by solving (5.2.4) for the unknown a, b. 

The above example demonstrates that for the on-line estimation of a, b 
to be possible, the input signal has to be chosen so that y{t) carries suffi- 
cient information about a, b. This conclusion is obviously independent of the 
method or scheme used to estimate a, b. 

Let us now consider an on-line estimation scheme for the first-order plant 
(5.2.3). For simplicity we assume that y is measured and write (5.2.3) in the 
familiar form of the linear parametric model 

z = r T </> 

where z = y,0* = [b, a] T , 0 = [u, — y\ T . 

Following the results of Chapter 4, we consider the following gradient 
algorithm for estimating 6* given in Table 4.2. 

0 = T e( j) : r = r T > 0 

e = z — z, z = 0 T cj) 


(5.2.5) 
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where 9{t) is the estimate of 9* at time t and the normalizing signal m 2 = 1 
due to 4> G Coc- 

As we established in Chapter 4, (5.2.5) guarantees that e, 9 E /^DAxm 
9 6 /loo, and if u e Too then e(t), 0(f) — > 0 as t — > oo. If in addition f is Pi?, 
i.e., it satisfies 


1 rt+To 

cni I > =- / </>(t)</> T ( r)dr > a 0 /, Vf > 0 

Jo Jt 


(5.2.6) 


for some To,ao,ai > 0, then 0(f) — > 9* exponentially fast, 
vector cj) is given by 


1 

b 

s+a 


U 


Because the 


the convergence of 9(t) to 9* is guaranteed if we choose u so that (j) is PE , 
i.e., it satisfies (5.2.6). Let us try the choices of input u considered earlier. 
For u = cq the vector (f> at steady state is given by 


L “ J 

which does not satisfy the right-hand-side inequality in (5.2.6) for any con- 
stant Co- Hence, for u = cq, 4> cannot be PE and, therefore, 0(t) cannot be 
guaranteed to converge to 9* exponentially fast. This is not surprising since 
as we showed earlier, for u = co, y(t) does not carry sufficient information 
about a, b at steady state. 

On the other hand, for u = sinf we have that at steady state 


sinf 

— Hsin(f + ip) 


where A = — ^== , (p = (sgn (b) — 1)90° — tan -1 ^ , which can be shown to 
satisfy (5.2.6). Therefore, for u = sint the signal vector cj) carries sufficient 
information about a and b, cj) is PE and 9{t ) — > 9* exponentially fast. 

We say that u = sin t is sufficiently rich for identifying the plant (5.2.3), 
i.e., it contains a sufficient number of frequencies to excite all the modes of 
the plant. Because u = cq / 0 can excite only the zero frequency gain of the 
plant, it is not sufficiently rich for the plant (5.2.3). 
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Let us consider the second order plant 


V = 


b\s + b 0 
+ aqs + oq 


u = G(s)u 


(5.2.7) 


where a±,ao > 0 and G(s ) has no zero-pole cancellations. We can show 
that for u = sincuoL y{t ) at steady state does not carry sufficient infor- 
mation to be able to uniquely determine ai, ao, £>i, &o- O n the other hand, 
u(t) = sin wot + sinaqf where uiq / uj\ leads to the steady-state response 


y(t) = A 0 sin(w 0 t + To) + M sin((Jit + ipi) 


where A 0 =| G(juj 0 ) \,<po = LG(juJo),Ai =| G(jv i) \,ti = IG(jcJi). By 
measuring Aq, A\, to, Ti we can determine uniquely ai,ao,6i,6o by solving 
four algebraic equations. 

Because each frequency in u contributes two equations, we can argue that 
the number of frequencies that u should contain, in general, is proportional 
to the number of unknown plant parameters to be estimated. 

We are now in a position to give the following definition of sufficiently 
rich signals. 


Definition 5.2.1 A signal u : 1Z + — > IZ is called sufficiently rich of order 
n if it consists of at least | distinct frequencies. 

For example, the input 

m 

u = ^2 Ai sincujf (5.2.8) 

i= 1 

where m > n/2, Ai / 0 are constants and uii / uok for i / k is sufficiently 
rich of order n. 

A more general definition of sufficient richness that includes signals that 
are not necessarily equal to a sum of sinusoids is presented in [201] and is 
given below. 

Definition 5.2.2 A signal u : n + -»• iz n is said to be stationary if the 
following limit exists uniformly in to 

l fto+T 

R u (t) = lim — / u(t)u (t + T)dr 
T^oo I J to 
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The matrix R u (t) G pi^xn j g ca n ec j the autocovariance of u. R u {t) is a 
positive semidefinite matrix and its Fourier transform given by 

/ OO 

e-i“ T R u {r)dT 

-OO 

is referred to as the spectral measure of u. If u has a sinusoidal component at 
frequency ujq then u is said to have a spectral line at frequency ujq and S u {uj) 
has a point mass (a delta function) at loq and — loq. Given S u (u), R u {t) can 
be calculated using the inverse Fourier transform, i.e., 

1 r°° 

Ru(t) = ^ / e^Sui^du 

J — oo 

Furthermore, we have 

/ OO 

S u (io)du> = 2irR u (0) 

-oo 

For further details about the properties of R u {t), S u (lo), the reader is referred 
to [186, 201]. 

Definition 5.2.3 A stationary signal u : IZ + — » IZ is called sufficiently 
rich of order n, if the support of the spectral measure S u (u) of u contains 
at least n points. 

Definition 5.2.3 covers a wider class of signals that includes those specified 
by Definition 5.2.1. For example, the input (5.2.8) has a spectral measure 
with 2 m points of support, i.e., at uii , —a for i = 1, 2, . . . m, where m > n/2, 
and is, therefore, sufficiently rich of order n. 

Let us now consider the equation 

4> = H(s)u (5.2.9) 

where H(s) is a proper transfer matrix with stable poles and G IZ n . The 
PE property of <f is related to the sufficient richness of u by the following 
theorem given in [201]. 

Theorem 5.2.1 Let u : IZ + e- > IZ be stationary and assume that H(juji), 

. . H(jio n ) are linearly independent on C n for all cji,a;2, . . . ,m n G IZ, where 
uii / for i / k. Then <ft is PE if, and only if, u is sufficiently rich of 
order n. 
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The proof of Theorem 5.2.1 is given in Section 5.6. 

The notion of persistence of excitation of the vector (j> and richness of the 
input u attracted the interest of several researchers in the 1960s and 1970s 
who gave various interpretations to the properties of PE and sufficiently 
rich signals. The reader is referred to [1, 171, 201, 209, 242] for further 
information on the subject. 

Roughly speaking, if u has at least one distinct frequency component for 
each two unknown parameters, then it is sufficiently rich. For example, if 
the number of unknown parameters is n, then rn > ^ distinct frequencies 
in u are sufficient for u to qualify as being sufficiently rich of order n. Of 
course, these statements are valid provided H(j uq), . . . , H(juj n ) with cu* / LOk 
are linearly independent on C n for all Wj £ 77 , i = 1,2, ... ,n. The vectors 
H(ju>i), i = 1,2 , ... ,n may become linearly dependent at some frequencies 
in 77 under certain conditions such as the one illustrated by the following 
example where zeros of the plant are part of the internal model of u. 


Example 5.2.1 Let us consider the following plant: 


V = 


bo(s 2 + 1) 
(s + 2)3 


u = G(s)u 


where bo is the only unknown parameter. Following the procedure of Chapter 2, we 
rewrite the plant in the form of the linear parametric model 


y = e*<p 


where 9* = bo is the unknown parameter and 

According to Theorem 5.2.1, we first need to check the linear independence of 
H{ju> i), . . . ,H{j(jj n ). For n = 1 this condition becomes H{ju) ^ 0,Vw £ 77. It is 
clear that for ui = 1, H (j) = 0, and, therefore, 4> may not be PE if we simply choose 
u to be sufficiently rich of order 1. That is, for u = sint, the steady-state values of 
<f>, y are equal to zero and, therefore, carry no information about the unknown bo- 
We should note, however, that for u = sinwt and any u / 1, 0, 0 is PE. V 


Remark 5.2.1 The above example demonstrates that the condition for the 
linear independence of the vectors i = 1, 2, ■ ■ ■ , n on C n is suf- 

ficient to guarantee that 4> is PE when u is sufficiently rich of order n. 
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It also demonstrates that when the plant is partially known, the input 
u does not have to be sufficiently rich of order n where n is the order of 
the plant. In this case, the condition on u can be relaxed, depending 
on the number of the unknown parameters. For further details on the 
problem of prior information and persistent of excitation, the reader 
is referred to [35]. 

In the following sections we use Theorem 5.2.1 to design the input signal 
u for a wide class of parameter estimators developed in Chapter 4. 

5.2.2 Parameter Identifiers with Full- State Measurements 

Let us consider the plant 

x = Ax + Bu, x( 0 ) = .To (5.2.10) 

where C T = I , i.e., the state x E TZ n is available for measurement and A, B 
are constant matrices with unknown elements that we like to identify. We 
assume that A is stable and u E Coo. 

As shown in Chapter 4, the following two types of parameter estimators 
may be used to estimate A, B from the measurements of x, u. 

Series-Parallel 

x = A m x + (A — A m )x + Bu (5.2.11) 

A = 7 ieix T , B = 7 2 ei u t 

where A m is a stable matrix chosen by the designer, ei = x — x, 71,72 > 0 
are the scalar adaptive gains. 

Parallel 

'x = Ax + Bu (5.2.12) 

A = 7 1 e 1 f T , B = 7 2 e! u t 

where e± = x — x and 71,72 > 0 are the scalar adaptive gains. 

As shown in Chapter 4, if A is a stable matrix and u E Coo then 
x ,A, B E Coo'-, ||A(t)yB(i)||,ei E £2 C Coo and ei (t) and the elements 
of A(t), B(t) converge to zero as t — > 00 . 
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For the estimators (5.2.11) and (5.2.12) to become parameter identifiers, 
the input signal u has to be chosen so that A(t),B(t) converge to the un- 
known plant parameters A, B, respectively, as t — > oo. 

For simplicity let us first consider the case where u is a scalar input, i.e. , 

b g n nxl . 

Theorem 5.2.2 Let ( A , B) be a controllable pair. If the input u G 1Z 1 is suf- 
ficiently rich of order n + 1, then the estimates A, B generated by (5.2.11) or 
(5.2.12) converge exponentially fast to the unknown plant parameters A,B, 
respectively. 

The proof of Theorem 5.2.2 is quite long and is presented in Section 5.6. 

An example of a sufficiently rich input u for the estimators (5.2.11), 
(5.2.12) is the input 


= ^2 Ai sin uifi 


i= 1 

for some constants Ai A 0 and LOi LOk for i k and for some integer 
m > ^± 1 . 

Example 5.2.2 Consider the second-order plant 

x = Ax + Bu 

where x = [xi,X 2 ]' , and the matrices A,B are unknown, and A is a stable matrix. 


Using (5.2.11) with A m = 
given by 

" -1 0 
0 -1 

where x = [xi,X 2 ] t and 


-1 0 
0 -1 


(x — x) + 


, the series-parallel parameter identifier is 


aii(f) 012(f) 
021 (f) 022 (f) 


h(t) 

62 (f) 


ditz — lx-i xf^xjzi i — 1,2, k — 1,2 

h = (. Xi - Xi)u, i = 1, 2 


The input u is selected as 


u = 5 sin 2.5f + 6 sin 6 . If 
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which has more frequencies than needed since it is sufficiently rich of order 4. An 
input with the least number of frequencies that is sufficiently rich for the plant 
considered is 

u = Co + sin u> 0 t, 

for some Co f 0 and to o f 0. V 

If u is a vector, i.e. , u G IZ q ,q > 1 then the following theorem may be 
used to select u. 

Theorem 5.2.3 Let (A, B) be a controllable pair. If each element Ui,i = 
1,2, ... q of u is sufficiently rich of order n + 1 and uncorrelated, i.e., each Ui 
contains different frequencies, then A(t), B(t) converge to A, B, respectively, 
exponentially fast. 

The proof of Theorem 5.2.3 is similar to that for Theorem 5.2.2, and is 
given in Section 5.6. 

The controllability of the pair ( A, B ) is critical for the results of The- 
orems 5.2.2 and 5.2.3 to hold. If ( A, B ) is not a controllable pair, then 
the elements of (A, B) that correspond to the uncontrollable part cannot be 
learned from the output response because the uncontrollable parts decay to 
zero exponentially fast and are not affected by the input u. 

The complexity of the parameter identifier may be reduced if some of 
the elements of the matrices ( A, B ) are known. In this case, the order of 
the adaptive law can be reduced to be equal to the number of the unknown 
parameters. In addition, the input u may not have to be sufficiently rich of 
order n + 1. The details of the design and analysis of such schemes are left 
as exercises for the reader and are included in the problem section. 

In the following section we extend the results of Theorem 5.2.2 to the case 
where only the output of the plant, rather than the full state, is available 
for measurement. 

5.2.3 Parameter Identifiers with Partial- State Measurements 

In this section we concentrate on the SISO plant 

x = Ax + Bu, x(0) = xo (5.2.13) 

y = C T x 



5.2. PARAMETER IDENTIFIERS 


261 


where A is a stable matrix, and y,u € 1Z 1 are the only signals available for 
measurement. 

Equation (5.2.13) may be also written as 

y = C T (si - A)~ l Bu + C T (si - A)~ l x 0 (5.2.14) 


where, because of the stability of A, et = C~ l {C T (si — ^4) _1 }xo is an expo- 
nentially decaying to zero term. We would like to design an on-line parameter 
identifier to estimate the parameters A,B,C. The triple ( A,B,C ) contains 
n 2 + 2n unknown parameters to be estimated using only input /output data. 
The I/O properties of the plant (5.2.14) at steady state (where e* = 0), how- 
ever, are uniquely determined by at most 2 n parameters. These parameters 
correspond to the coefficients of the transfer function 


= C T (sI - AY l B = 6mSm +6m - 1 4 1 + ' ' ' + 6 ° 
u(s) s n + a n -is n 1 + . . . + a 0 


(5.2.15) 


where m < n — 1. 

Because there is an infinite number of triples (A, B,C) that give the same 
transfer function (5.2.15), the triple (A,B,C) associated with the specific 
or physical state space representation in (5.3.13) cannot be determined, in 
general, from input-output data. The best we can do in this case is to 
estimate the n + m + 1 < 2 n coefficients of the plant transfer function 
(5.2.15) and try to calculate the corresponding triple (A,B,C) using some 
a priori knowledge about the structure of (A,B,C). For example, when 
(A, B, C ) is in a canonical form, (^4, B, C ) can be uniquely determined from 
the coefficients of the transfer function by using the results of Chapter 2. In 
this section, we concentrate on identifying the n + m+ 1 coefficients of the 
transfer function of the plant (5.2.13) rather than the n 2 + 2 n parameters of 
the triple (A, B, C). 

The first step in the design of the parameter identifier is to develop 
an adaptive law that generates estimates for the unknown plant parameter 
vector 

6 — [b-mi b m — i, . . . , 6o, (l n — 1 , &n— 2) • • • > > ®o] 

that contains the coefficients of the plant transfer function. As we have 
shown in Section 2.4.1, the vector 8* satisfies the plant parametric equation 


z = e* T ^ + e t 


(5.2.16) 
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where 


z 





A {s) 


«n-l(») 

A(a) V 


C T adj(s/ - A) 

aR 


Xo, 


CCi(s) 


\s , s 


A— 1 


.., 1 ] 


T 


and A (s) is an arbitrary rnonic Hurwitz polynomial of degree n. 

Using (5.2.16), we can select any one of the adaptive laws presented in 
Tables 4.2, 4.3, and 4.5 of Chapter 4 to estimate 9*. As an example, consider 
the gradient algorithm 


9 = T e<£ (5.2.17) 

e = z — z, z = 9 T cj) 


where T = T t > 0 is the adaptive gain matrix, and 9 is the estimate of 9* 
at time t. The normalizing signal m 2 in this case is chosen as m 2 = 1 due 
to (j) £ Too • 

The signal vector 0 and signal 2 may be generated by the state equation 


</>o — A c (/> 0 + lu, 

<t>i = H)<Po 

<j > 2 = A c (/> 2 - ly, 

0 = [AaJY 

z = y + A t </>2 

where fo £ K n , 0i £ ft m+1 ,0 2 £ H n 



' 1 ' 

— a t 


0 


, i = 

I n -\ o 





0 


MO) = o 
02 ( 0 ) = 0 


Po = 


o 


(m+l)x(n— m— 1) 


lm+1 1 e ^ (m+1)xn , A 


(5.2.18) 


1 

An- 2 

Ao 
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Ii is the identity matrix of dimension i x i . Oi X k is a matrix of dimension i by k 
with all elements equal to zero, and det(s/ — A c ) = A(s) = s n + A T a n _i(s). 
When m = n — 1, the matrix Pq becomes the identity matrix of dimension 
n x n. 

The state equations (5.2.18) are developed by using the identity 


(si — A c ) -1 / 


o n _ i(s) 

^r 


established in Chapter 2. 

The adaptive law (5.2.17) or any other one obtained from Tables 4.2 
and 4.3 in Chapter 4 guarantees that e, 9,z £ e,9 £ £2 H £00 and 

e(t),6(t) — > 0 as t — > 00 for any piecewise bounded input u. For these 
adaptive laws to become parameter identifiers, the input signal u has to be 
chosen to be sufficiently rich so that f is PE, which in turn guarantees that 
the estimated parameters converge to the actual ones. 

Theorem 5.2.4 Assume that the plant transfer function in (5.2.15) has no 
zero-pole cancellations. If u is sufficiently rich of order n + m + 1, then the 
adaptive law (5.2.17) or any other adaptive law from Tables f.2, f.3, and 
4-5 of Chapter 4, based on the plant parametric model (5.2.16), guarantees 
that the estimated parameter vector 9(f) converges to 9* . With the exception 
of the pure least-squares algorithm in Table £5 where the convergence is 
asymptotic, all the other adaptive laws guarantee exponential convergence of 
9 to 9*. 

Proof In Chapter 4, we have established that 9(f) converges to 9* if the signal (f> 
is PE. Therefore we are left to show that f is PE if u is sufficiently rich of order 
n + m + 1 and the transfer function has no zero-pole cancellations. 

From the definition of </>, we can write 

f = H(s)u 

and 

H (s) = [am( s )>- a n-i G ( s )] T 

[ a m( s )A(s), —af_ 1 (s)Z(s )\ T 


A (s)R(s) 


(5.2.19) 
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where G(s ) = is the transfer function of the plant. We first show by contradic- 
tion that {H(juii), H(j(jj 2 ), . . . , H{ju> n+rn+ i)} are linearly independent in C n+m+1 
for any u> 1 , . . . , u) n +m- 1-1 G R and u>i ^ u>k, i,k = 1, . . . , n + m + 1. 

Let us assume that there exist u> i,. . . ,u) n + m +i such that H(juj{), . . . ,H(juj n + m + 1 ) 
are linearly dependent, that is, there exists a vector h = [c m , c m _i , . . ., Co, 
d n -i,d n - 2 , ■ ■ • , do] T G C n+m+1 such that 

H T {ju> i) 

H T (juj 2 ) 

. h = 0 n+m+1 (5.2.20) 

H (j^n+m+l) 

where is a zero vector of dimension n + m + 1. Using the expression (5.2.19) 

for H(s), (5.2.20) can be written as 

— — r^— — - [b(ju>i)R(juJi) + a(juJi)Z(ju>i)] =0, i = 1, 2, . . . , n + m + 1 
where 

6(s) = c m s m + c m _is m_1 H h c 0 

a(s) = -d„_is n_1 - d n - 2 s n ~ 2 do 

Now consider the following polynomial: 

/(s) = a(s)Z(s) + b{s)R{s) 

Because /(s) has degree of at most m + n and it vanishes at s = ju)i, i = 1, . . . , n + 
to + 1 points, it must be identically equal to zero, that is 

a(s)Z(s) + b(s)R(s) = 0 


or equivalently 


G(s) = 


(5.2.21) 


for all s. However, because a(s) has degree at most ra — 1, (5.2.21) implies that G(s ) 
has at least one zero-pole cancellation, which contradicts the assumption of no zero- 
pole cancellation in G(s). Thus, we have proved that iL(jw,), * = 1, . . . , n + m + 1 
are linearly independent for any Ui ^ ujk, i,k = 1, . . . , n+TO+1. It then follows from 
Theorem 5.2.1 and the assumption of u being sufficiently rich of order n + m + 1 
that 4> is PE and therefore 6 converges to 6*. As shown in Chapter 4, the convergence 
of 6 to 9* is exponential for all the adaptive laws of Tables 4.2, 4.3, and 4.5 with 


the exception of the pure least-squares where the convergence is asymptotic. □ 
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An interesting question to ask at this stage is what happens to the pa- 
rameter estimates when u is sufficiently rich of order less than n + rn + 1 , 
i.e., when its spectrum S u (lo) is concentrated on k < n + m + 1 points. 

Let us try to answer this question by considering the plant parametric 
equation (5.2.16) with e* = 0, i.e., 

z = 0* T (j) = 6* t H(s)u 

where H(s) is given by (5.2.19). 

The estimated value of z is given by 

5 = d T (j) = 0 T H(s)u 


The adaptive laws of Tables 4.2, 4.3, and 4.5 of Chapter 4 based on the 
parametric equation (5.2.16) guarantee that z(t) — > z(t) as t — > oo for any 
bounded input u. If u contains frequencies at uji,U 2 , ■ ■ ■ ,uJk, then as t — > 
oo, z = 8 t H(s)u becomes equal to z = 9* T H(s)u at these frequencies. 
Therefore, as t — > oo, 6 must satisfy the equation 

Q CM. • • • ,M) 6 = Q (M. • • • ,M) 


where 


Q (jA>i , • • • , j^k) 


' H T (ju i) - 

. # t (M) . 


isafexn + m + 1 matrix with k linearly independent row vectors. Hence, 


Q(M>---.M)(0-0*) = 0 

which is satisfied for any 0 for which 0 — 6* is in the null space of Q. If 
A; < n + m + 1 it follows that the null space of Q contains points for which 
6^6*. This means that z can match z at k frequencies even when 6^6*. 

The following Theorem presented in [201] gives a similar result followed 
by a more rigorous proof. 


Theorem 5.2.5 (Partial Convergence) Assume that the plant transfer 
function in (5.2.15) has no zero-pole cancellations. If u is stationary, then 

lim RJO) (d(t) -0*) = 0 

t — .00 ^ 

where Oft) is generated by (5.2.17) or any other adaptive law from Tables 
4-2, 4-3, and 4-5 of Chapter 4 based on the plant parametric model (5.2.16). 
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The proof of Theorem 5.2.5 is given in Section 5.6. 

Theorem 5.2.5 does not imply that 9(t) converges to a constant let alone 
to 8*. It does imply, however, that 9{t) = 9(t) — 8* converges to the null 
space of the autocovariance R$( 0) of 0, which depends on H(s) and the 
spectrum of u. In fact it can be shown [201] that R ( p( 0) is related to the 
spectrum of u via the equation 

k 

R<t>{ 0) = Y, H (-M)H T (jvi)Su(vi) 

1=1 

which indicates the dependence of the null space of R ( p( 0) on the spectrum 
of the input u. 


Example 5.2.3 Consider the second order plant 


V = 


bo 

s 2 + ais + ao 


where ai,ao > 0 and b 0 yf 0 are the unknown plant parameters. We first express 
the plant in the form of 

^ = e * T ( t > 


where 9* = [6 0 , an, a 0 ] T , 2 = <£ = 


1 [s,l] 

A(A u ’~*is) y 


and choose A(s) = (s + 2) 2 


Let us choose the pure least-squares algorithm from Table 4.3, i.e., 


9 = Pe<j), 9(0) = 9 0 
P = -P^P, P(0) = p 0 I 

e = z — 9 t 4> 


where 9 is the estimate of 9* and select po = 50. The signal vector cf> = [<pj ,(j)J] T 
is generated by the state equations 

4> o = A c </>o + In 
(j>i = [0 1]4> 0 

(\>2 = A c </>2 - ly 

z = y+[ 4 4] cj >2 


where A r 


1 —4 
1 0 


,1 = 


The reader can demonstrate via computer 

simulations that: (i) for u = 5sint + 12sin3f, 9(t) — > 9* as t — > oo; (ii) for u = 
12 sin 3f , 9(t) — > 9 as t — » oo where 9 is a constant vector that depends on the initial 
condition 0(0). V 
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Remark 5.2.2 As illustrated with Example 5.2.3, one can choose any one 
of the adaptive laws presented in Tables 4.2, 4.3, and 4.5 to form pa- 
rameter identifiers. The complete proof of the stability properties of 
such parameter identifiers follows directly from the results of Chap- 
ter 4 and Theorem 5.2.4. The reader is asked to repeat some of the 
stability proofs of parameter identifiers with different adaptive laws in 
the problem section. 


5.3 Adaptive Observers 

Consider the LTI SISO plant 

x = Ax + Bu, x(0) = xo (5.3.1) 

y = C T x 

where x £ IZ n . We assume that u is a piecewise continuous and bounded 
function of time, and A is a stable matrix. In addition we assume that the 
plant is completely controllable and completely observable. 

The problem is to construct a scheme that estimates both the parame- 
ters of the plant, i.e., A,B,C as well as the state vector x using only I/O 
measurements. We refer to such a scheme as the adaptive observer. 

A good starting point for choosing the structure of the adaptive observer 
is the state observer, known as the Luenberger observer, used to estimate the 
state vector x of the plant (5.3.1) when the parameters A , B , C are known. 

5.3.1 The Luenberger Observer 

If the initial state vector xo is known, the estimate x of x in (5.3.1) may be 
generated by the state observer 


x = Ax + Bu, x(0) = xo (5.3.2) 

where x £ TZ n . Equations (5.3.1) and (5.3.2) imply that x(t) = x(t),Vf > 0. 
When xo is unknown and A is a stable matrix, the following state observer 
may be used to generate the estimate x of x: 


x = Ax + Bu, x(0) = xq 


(5.3.3) 
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In this case, the state observation error x = x — x satisfies the equation 

x = Ax, 'ti(O) = Xq — Xq 

which implies that x(t) = e At x(0). Because A is a stable matrix x(t) — ► 0, 
i.e. , x(t) — > x(t) as t — > oo exponentially fast with a rate that depends on 
the location of the eigenvalues of A. The observers (5.3.2), (5.3.3) contain 
no feedback terms and are often referred to as open-loop observers. 

When xq is unknown and A is not a stable matrix, or A is stable but 
the state observation error is required to converge to zero faster than the 
rate with which || e At || goes to zero, the following observer, known as the 
Luenberger observer, is used: 

x = Ax + Bu + K(y — y), x(0) = xq (5.3.4) 

y = C T x 

where K is a matrix to be chosen by the designer. In contrast to (5.3.2) and 
(5.3.3), the Luenberger observer (5.3.4) has a feedback term that depends 
on the output observation error y = y — y. 

The state observation error x = x — x for (5.3.4) satisfies 

x = {A — KC t )x, x(0) = xq — xq (5.3.5) 

Because (C, A) is an observable pair, we can choose K so that A — KC T is 
a stable matrix. In fact, the eigenvalues of A — KC', and, therefore, the 
rate of convergence of x(t) to zero can be arbitrarily chosen by designing K 
appropriately [95]. Therefore, it follows from (5.3.5) that x(t) — > x(t) expo- 
nentially fast as t — > oo, with a rate that depends on the matrix A — KC T . 
This result is valid for any matrix A and any initial condition xq as long as 
{C, A) is an observable pair and A, C are known. 


Example 5.3.1 Consider the plant described by 

' -4 1 1 I" 1 ' 

* = [ —4 oJ X+ [3j M 

y = [i,o] X 

The Luenberger observer for estimating the state x is given by 
* = [-4 0 ] [ 3 ] 

y = [i,o]x 
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-4 -k t 1 ' 
-4 - k 2 0 


is a stable matrix. Let us assume that x(t) is required to converge to x(t) faster 
than e _5t . This requirement is achieved by choosing ki,k 2 so that the eigenvalues 
of Aq are real and less than —5, i.e., we choose the desired eigenvalues of Aq to be 
Ai = —6, A 2 = —8 and design k\ , k 2 so that 

det (si — Ap) = s 2 + (4 + k\)s + 4 + ^2 = (s + 6) (s + 8) 


which gives 


k-, = 10, k 2 = 44 


V 


5.3.2 The Adaptive Luenberger Observer 

Let us now consider the problem where both the state x and parameters 
A, B, C are to be estimated on-line simultaneously using an adaptive ob- 
server. 

A straightforward procedure for choosing the structure of the adaptive 
observer is to use the same equation as the Luenberger observer in (5.3.4), 
but replace the unknown parameters A,B,C with their estimates A,B,C, 
respectively, generated by some adaptive law. The problem we face with 
this procedure is the inability to estimate uniquely the n 2 + 2 n parameters 
of A, B, C from input/output data. As explained in Section 5.2.3, the best we 
can do in this case is to estimate the n + m+1 < 2 n parameters of the plant 
transfer function and use them to calculate A, B , C. These calculations, 
however, are not always possible because the mapping of the 2 n estimated 
parameters of the transfer function to the n 2 + 2 n parameters of A, 13 , C 
is not unique unless (A,B,C) satisfies certain structural constraints. One 
such constraint is that ( A,B,C ) is in the observer form, i.e., the plant is 
represented as 

In— 1 

x a = _ x a + b p u (5.3.6) 

: 0 

y = [ 10 ... 0] x a 
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where a p = [a n _i, a n - 2, • • • ao] T and b p = [6 n _ 1, b n - 2, ... 60] T are vectors of 
dimension n and I n ~i G 7^.( n-1 ) x ( n-1 ) is the identity matrix. The elements of 
a p and b p are the coefficients of the denominator and numerator, respectively, 
of the transfer function 

y{s) 6 n -is n_1 + b n - 2 s n ~ 2 + • • • + b 0 , ^ 

i (5.3.7) 

u(s) s n + a n -is n 1 + ...a 0 

and can be estimated on-line from input/output data by using the techniques 
described in Chapter 4. 

Because both (5.3.1) and (5.3.6) represent the same plant, we can assume 
the plant representation (5.3.6) and estimate x a instead of x. The disadvan- 
tage is that in a practical situation x may represent physical variables that 
are of interest, whereas x a may be an artificial state variable. 

The adaptive observer for estimating the state x a of (5.3.6) is motivated 
from the structure of the Luenberger observer (5.3.4) and is given by 

x = A(t)x + b p (t)u + K(t)(y — y) (5.3.8) 

y = [10... 0]x 

where x is the estimate of x a , 

In— 1 

A(t) = — a p (t) : ... > A (f ) = a — a p (t ) 

: 0 

a* G TZ n is chosen so that 

In— 1 

A*= —a* : ... (5-3.9) 

: 0 

is a stable matrix and a p (t) and b p (t) are the estimates of the vectors a p and 
b p , respectively, at time t. 

A wide class of adaptive laws may be used to generate a p (t) and b p (t) 
on-line. As an example, we can start with (5.3.7) to obtain as in Section 
2.4.1 the parametric model 


z = 6* T <j> 


(5.3.10) 
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where 

<t> 

z 

A (s) 

and 

9 — [bn— 1 j b n — 2 , . . . , On— 1 1 ®n— 2i ■ • • i ®o] 

is the parameter vector to be estimated and A(s) is a Hurwitz polynomial 
of degree n chosen by the designer. A state-space representation for (f> and z 
may be obtained as in (5.2.18) by using the identity (si — Ac) -1 / = 
where (A c , /) is in the controller canonical form and det(sl — A c ) = A(s). 

In view of (5.3.10), we can choose any adaptive law from Tables 4.2, 
4.3 and 4.5 of Chapter 4 to estimate 9* and, therefore, a p , b p on-line. We 
can form a wide class of adaptive observers by combining (5.3.8) with any 
adaptive law from Tables 4.2, 4.3 and 4.5 of Chapter 4 that is based on the 
parametric plant model (5.3.10). 

We illustrate the design of such adaptive observer by using the gradient 
algorithm of Table 4.2 (A) in Chapter 4 as the adaptive law. The main 
equations of the observer are summarized in Table 5.1. 

The stability properties of the class of adaptive observers formed by 
combining the observer equation (5.3.8) with an adaptive law from Tables 
4.2 and 4.3 of Chapter 4 are given by the following theorem. 

Theorem 5.3.1 An adaptive observer for the plant (5.3.6) formed by com- 
bining the observer equation (5.3.8) and any adaptive law based on the plant 
parametric model (5.3.10) obtained from Tables f.2 and f.3 of Chapter 4 
guarantees that 

(i) All signals are u.b. 

(ii) The output observation error y = y — y converges to zero as t — > oo. 

(iii) If u is sufficiently rich of order 2 n, then the state observation error 
x = x a — x and parameter error 8 = 9 — 9* converge to zero. The rate 
of convergence is exponential for all adaptive laws except for the pure 
least-squares where the convergence is asymptotic. 


oj-l _Q )„ On-lW 
A (s 
s" 




-i T 


= 

' ' 

■o- 

(O-l 


lT 


-y = y + \ T <f >2 


A (s) 

s 11 + A T a n _i(s) 
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Table 5.1 Adaptive observer with gradient algorithm 


Plant 

• In— 1 

x a = _ a p : ... x a + b p u, x a £ IZ n 

L : 0 J 

y = [ 1 , 0 ... o]x a 

Observer 

• In— 1 

X= —a p (t) : ... x+b p {t)u+{a*-a p {t)){y-y) 

[ : 0 J 

y = [1 0 . . . 0 \x 

Adaptive law 

6 = I 

e = 

z = t 

(j) = 

z = - 

"z<t> 

bJ{t),aJ{t)V , e = ^S, 
r = r t > 0 

«,T_i(s) 1 T 

A (s) A(s) y 

„n 

Design variables 

a* is chosen so that A* in (5.3.9) is stable; m 2 = 

1 or m ? = 1 + A(s) is a rnonic Hurwitz poly- 

nomial of degree n 


Proof (i) The adaptive laws of Tables 4.2 and 4.3 of Chapter 4 guarantee that 
e, em, 9 £ £ 2 n£ oc and 9 £ £oo independent of the boundedness of u, y. Because u € 
£oo and the plant is stable, we have x a , y , </>, m € £oo ■ Because of the boundedness 
of y, 4> we can also establish that e, em, 9 — > 0 as t — * oo by showing that e £ £,*> 
(which follows from 9, </> £ £ 00 ), which, together with e £ £ 2 , implies that e — > 0 as 
t — > 00 (see Lemma 3.2.5). Because m,(f> £ £<*,, the convergence of em, 9 to zero 
follows. 

The proof of (i) is complete if we establish that x £ £oo ■ We rewrite the observer 
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equation (5.3.8) in the form 

x = A* x + b p (t)u + ( A(t ) — A*)x 0 


(5.3.11) 


Because 9 = 




,u,x a € Coo and A* is a stable matrix, it follows that 
x € Coo- Hence, the proof of (i) is complete. 

(ii) Let x = x a — 5 be the state observation error. It follows from (5.3.11), 
(5.3.6) that 

x = A*x — b p u + a p y , 5(0) = a; a (0) — 5(0) (5.3.12) 

where b p = b p — b p , a p = a p — a p . From (5.3.12), we obtain 

y = C T x(s ) = C T (sI - A*) _1 (-& p u + a p y ) + e t 

where e* = C {C T (s/ — A*) -1 } 5(0) is an exponentially decaying to zero term. 
Because (C. A) is in the observer canonical form, we have 


C T (. si- A *)" 1 = 


I-i(s) [s n_1 , s n_2 , . . . s, 1] 


det {si - A*) det(sl — A*) 


A 


Letting A*(s) = det (si — A*), we have 

1 n 

y(s) = — ^ V s n ~ 
’ A *(s) 

x ' i = 1 


-b n -iU + a n -iy 


e t 


where bi,cii is the ith element of b p and a p , respectively, which may be written as 

A(s) s" 




Up CtJ-lVJ. Ujp 

c-n—i r 


A *(s) A(s) L 

v 7 Z— 1 v 7 


—b n -iU + a n -ty 


£t 


(5.3.13) 


where A (s) is the Hurwitz polynomial of degree n defined in (5.3.10). We now apply 
Lemma A.l (see Appendix A) for each term under the summation in (5.3.13) to 
obtain 


V = 


A (a) 

A *(s) z 


E 




s n-i s n ~ l 

bn—i . 7 \ ^ A Cln—i . 7 7 y 

A(s) A(s) 


-W ci (s) (W bi (s)u) b n -i + W cl (s) (Wbi(s)y) b n -i\ + e t (5.3.14) 

where the elements of W C i(s ), W b i(s ) are strictly proper transfer functions with the 
same poles as A(s). Using the definition of <t> and parameter error 9 = 9 — 9*, we 
rewrite (5.3.14) as 


V = 


A (s) 

A *(s) 


-9 t 4> + y W C i(s) (- ( W bi (s)u ) b n -i + (W bi (s)y) a n -^j J> + e t 


i= 1 
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Figure 5.1 General structure of the adaptive Luenberger observer. 

Because b n -i,d n —i £ £ 2 H C 00 converge to zero as t — > 00, u,y £ £00 and the 
elements of W ci (s),Wbi(s) are strictly proper stable transfer functions, it follows 
from Corollary 3 . 3.1 that all the terms under the summation are in £2 H £00 and 
converge to zero as t — > 00. Furthermore, from em 2 = z — z = — 0 T (j), m £ £ oo, e £ 
£2 H £00 and e(t) — ■> 0 as t — > 0 , we have that 6 T <j> £ £ 2 fl £ ^ converges to zero as 
t — > 00. Hence, y is the sum of an output of a proper stable transfer function whose 
input is in £ 2 and converges to zero as t — > 00 and the exponentially decaying to 
zero term e t . Therefore, y(t) — » 0 as t — > 00. 

(iii) If is PE, then we can establish, using the results of Chapter 4 , that 
a p (t), b p (t) converge to zero. Hence, the input —b p u + a p y converges to zero, which, 
together with the stability of A*, implies that x(t) — > 0 . With the exception of the 
pure least-squares, all the other adaptive laws guarantee that the convergence of 
b p , d p to zero is exponential, which implies that x also goes to zero exponentially 
fast . 

The PE property of 4 > is established by using exactly the same steps as in the 
proof of Theorem 5 . 2 . 4 . □ 


The general structure of the adaptive observer is shown in Figure 5.1. 

The only a priori knowledge assumed about the plant (5.3.1) is that it is 
completely observable and completely controllable and its order n is known. 
The knowledge of n is used to choose the order of the observer, whereas 
the observability of (C, A) is used to guarantee the existence of the state 
space representation of the plant in the observer form that in turn enables 
us to design a stable adaptive observer. The controllability of (A, B) is not 
needed for stability, but it is used together with the observability of (C, A) 
to establish that 4> is PE from the properties of the input u. 
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Theorem 5.3.1 shows that for the state x a of the plant to be estimated 
exactly, the input has to be sufficiently rich of order 2 n, which implies that 
the adaptive law has to be a parameter identifier. Even with the knowledge of 
the parameters a p , b p and of the state x a , however, it is not in general possible 
to calculate the original state of the plant x because of the usual nonunique 
mapping from the coefficients of the transfer function to the parameters of 
the state space representation. 


Example 5.3.2 Let us consider the second order plant 


— ai 1 

x + 

' h ' 

-a 0 0 _ 

. & o . 

[1,0] x 




where Oo, bi, bo are the unknown parameters and u, y are the only signals avail- 
able for measurement. 

Using Table 5.1, the adaptive observer for estimating x , and the unknown pa- 
rameters are described as follows: The observer equation is given by 


x 

y 


-Oi(t) 1 
-ao(t) 0 

[ 1 , 0 ]* 


' foi(f) 


u + 


9 — ai(t) 

20 — ao(t) 


(y-y) 


where the constants a* = 9,aJ = 20 are selected so that A* 

eigenvalues at Ai = —5, A 2 = —4. 

The adaptive law is designed by first selecting 


-a\ 1 

— a o 0 


has 


A(s) = (s + 2 )(s + 3) = s 2 + 5s + 6 


and generating the information vector 0 = [tfij , 0j] T 


<Ai 
4> 2 


' -5 -6 ' 


' 1 ' 



1 0 

01 + 

0 

U 

' -5 -6 ' 


' -1 

- 


1 0 

02 + 

0 


y 


and the signals 


2 

z 


y + [5, 6] 


bi,h 


0i + [ai,a 0 ] 02 


e 


z — z 
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The adaptive law is then given by 


k 

k 




ai 

«2 


72^2 


The adaptive gains 71 , 72 > 0 are usually chosen by trial and error using simulations 
in order to achieve a good rate of convergence. Small 71,72 may result in slow 
convergent rate whereas large 71,72 may make the differential equations “stiff” and 
difficult to solve numerically on a digital computer. 

In order for the parameters to converge to their true values, the plant input u 
is chosen to be sufficiently rich of order 4. One possible choice for u is 


u = A\ sin LU\t + A2 sin u>2t 


for some constants Ai,A 2 fO and u> 1 f U2- 


V 


5.3.3 Hybrid Adaptive Luenberger Observer 

The adaptive law for the adaptive observer presented in Table 5.1 can be 
replaced with a hybrid one without changing the stability properties of the 
observer in any significant way. The hybrid adaptive law updates the param- 
eter estimates only at specific instants of time t^, where t^ is an unbounded 
monotonic sequence, and tk+i — tk = T s where T s may be considered as the 
sampling period. 

The hybrid adaptive observer is developed by replacing the continuous- 
time adaptive law with the hybrid one given in Table 4.5 as shown in Table 
5.2. The combination of the discrete-time adaptive law with the continuous- 
time observer equation makes the overall system more difficult to analyze. 
The stability properties of the hybrid adaptive observer, however, are very 
similar to those of the continuous-time adaptive observer and are given by 
the following theorem. 

Theorem 5.3.2 The hybrid adaptive Luenberger observer presented in Ta- 
ble 5.2 guarantees that 

(i) All signals are u.b. 

(ii) The output observation error y = y — y converges to zero as t — > 00. 

(iii) Ifu is sufficiently rich of order 2 n, then the state observation error x = 
x a — x and parameter error 6k = 6k ~ @* converge to zero exponentially 
fast. 
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Table 5.2 Hybrid adaptive Luenberger observer 


Plant 

x a = 

y = [1 

In— 1 

dp! . . . 

: 0 
, 0 . . . 0]it 

x a + b p u, x a G Tt n 

Observer 

x = 

y = [ 1 

x&R 

• In— 1 

-Ofc : 

: 0 
,0, . . .,0]f 
l n ,te[t k ,t k+ i). 

x+ b k u+ ( a*-a k )(y-y ) 

, As = 0, 1, . . . 

Hybrid 
adaptive law 

9 k+l = e k + r jJ +1 e(r)^(r)dr 

z(t) = 0j(/>(t), Vt€[t k ,t k+ 1) 
m 2 = 1 + a(j) T cf, a > 0 

, 0:3" 1(5) aj (s) e n „ a t 1 ^ 

0=1 ms) u - a(s) yj >*=4iy* , ’ 0fc= rfc’°fcJ 

Design variable 

a* is chosen so that H* in (5.3.9) is stable; A(s) is 
rnonic Hurwitz of degree n; T s = t k+ \ — t k is the 
sampling period; T = T t > 0, T s are chosen so that 

2 - T s X max (F) > 70 for some 70 > 0 


To prove Theorem 5.3.2 we need the following lemma: 

Lemma 5.3.1 Consider any piecewise constant function defined as 

f(t) = f k Vt G [kT s , (k + 1)T S ), k = 0,1, 

If the sequence {A/*.} G i 2 where Af k = / fc+1 — f k , then there exists a 
continuous function f(t ) such that \f — f\ G £2 and f € £ 2 - 
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Proof The proof of Lemma 5.3.1 is constructive and rather simple. Consider the 
following linear interpolation: 

f(t) = fk+ fk+1 ~ fk (: t - kT s ) Vi G [kT s , (k + 1 )T S ) 

s 


It is obvious that / has the following properties: (i) it is continuous; (ii) | /(f) — 
fit) | < |/fc + i - f k |; (hi) / is piecewise continuous and \f(t)\ = ^\fk+i - fk\ 
Vi G [/tTg, (fc + 1)T S ). Therefore, using the assumption A fk G I 2 we have 


and 



\.m-f(t)\ 2 dt<T s j2\\fk\ 2 

k = 0 


< 


OO 



\ 'f(t)\ 2 dt < \ A fk\ 2 < 

k = 0 


OO 


he., / G C- 2 - 


□ 


Proof of Theorem 5.3.2 

(i) We have shown in Chapter 4 that the hybrid adaptive law given in Table 4.5 
guarantees that Ok G loo', G tm G Coo f) £ 2 - Because u G Too and the plant is stable, 
we have y, x a , cj),m G Too- As we show in the proof of Theorem 5.3.1, we can write 
the observer equation in the form 

x = A*x+ b k u+ (A k - A*)x a , Vte[t k ,t k+ 1 ) (5.3.15) 


where A* is a stable matrix and 



In— 1 


0 


Because a k ,bk G loo', x a ,u G Coo, it follows from (5.3.15) that x G Coo, and, 
therefore, all signals are u.b. 

(ii) Following the same procedure as in the proof of Theorem 5.3.1, we can 
express y = y — y as 


y = C T (sI-A*) 1 {-b k u + a k y) + e t (5.3.16) 


where e* is an exponentially decaying to zero term. 

From Lemma 5.3.1 and the properties of the hybrid adaptive law, i.e., 0 k G l 0 0 
and \0k+i — Ok \ G £ 2 , we conclude that there exists a continuous piecewise vector 
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function 9 such that \9{t)— 9 k {t)\, 9 G Coo f] £ 2 , where 9k{t) is the piecewise constant 
function defined by 9 k {t) = 9k, Vt G [tk,tk+ 1 ). Therefore, we can write (5.3.16) as 

y = C T (sI - A*)~ 1 (-bu + ay) + f(t) + e t (5.3.17) 

where 

/(f) = C T (sI - A*) -1 (( b - b k )u - (a - a fc )y) 

Using Corollary 3.3.1, it follows from u,y G Coo and 1 9 — 9 k \ G Uoofl^a that 
/ € Coo D £2 and f(t) — > 0 as t — ■> 00 . 

Because now 9 has the same properties as those used in the continuous adaptive 
Luenberger observer, we can follow exactly the same procedure as in the proof 
of Theorem 5.3.1 to shown that the first term in (5.3.17) converges to zero and, 
therefore, y(t) — > 0 as t — > 00 . 

(iii) We have established in the proof of Theorem 5.2.4 that when u is sufficiently 
rich of order 2 n, <f> is PE. Using the PE property of <f> and Theorem 4.6.1 (iii) we 
have that 9 k — ► 9* exponentially fast. □ 


5.4 Adaptive Observer with Auxiliary Input 

Another class of adaptive observers that attracted considerable interest [28, 
123, 172] involves the use of auxiliary signals in the observer equation. An 
observer that belongs to this class is described by the equation 

x = A(t)x + b p (t)u + K(t)(y — y) + v, &(0) = x 0 
y = [10...0]x (5.4.1) 

where x, A(t ), b p (t), K(t ) are as defined in (5.3.8), and v is an auxiliary vector 
input to be designed. The motivation for introducing v is to be able to 
use the SPR-Lyapunov design approach to generate the adaptive laws for 
A(t), b p (t),K(t). This approach is different from the one taken in Section 
5.3 where the adaptive law is developed independently without considering 
the observer structure. 

The first step in the SPR-Lyapunov design approach is to obtain an error 
equation that relates the estimation or observation error with the parameter 
error as follows: By using the same steps as in the proof of Theorem 5.3.1, 
the state error x = x a — x satisfies 

x = A*x + a p y — b p u — v, 5(0) = xq 
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y = C T x (5.4.2) 

where a p , b p are the parameter errors. Equation (5.4.2) is not in the fa- 
miliar form studied in Chapter 4 that allows us to choose an appropriate 
Lyapunov-like function for designing an adaptive law and proving stability. 
The purpose of the signal vector v is to convert (5.4.2) into a form that is 
suitable for applying the Lyapunov approach studied in Chapter 4. The fol- 
lowing Lemma establishes the existence of a vector v that converts the error 
equation (5.4.2) into one that is suitable for applying the SPR-Lyapunov 
design approach. 


Lemma 5.4.1 There exists a signal vector v G lZ n , generated from mea- 
surements of known signals, for which the system (5. 4-2) becomes 

e = A*e + B C (-6 T f>), e(0) = xo 

y = C T e (5.4.3) 

where C T (si — A*)~ 1 B C is SPR, cf> G 1Z 2n is a signal vector generated from 
input/output data, 6 = [bJ,aJ\ T , and e G lZ n is a new state vector. 


Proof Because (C, A*) is in the observer form, we have 


C t (sI-A *)~ 1 


oj-i(s) 

A *(s) 


where a„_i(s) = [s n 1 , s n 2 , . . . , 1] T , A*(s) = det(sl — A*). Therefore, (5.4.2) can 
be expressed in the form 

y = C T (sI-A *)~ 1 a p y — b p u — v + e t 



<*n-l ( S ) \ 

A(s) 7 


+ et 


(5.4.4) 


where e t = C 1 |C T (s/ — A*) 1 } x 0 , A (s) = s n 1 + A T a„_ 2 (s) is a Hurwitz poly- 
nomial, A T = [A„_ 2 , A„_ 3 , . . . , Ai, Ao] is to be specified and a,, bi is the ith element 
of d p , b p respectively. 

Applying the Swapping Lemma A.l given in Appendix A to each term under 
the summation in (5.4.4), we obtain 
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We now use Lemma A.l to obtain expressions for W c i , Wm in terms of the param- 
eters of A(s). Because W c i, W b \ are the transfer functions resulting from swapping 
with the transfer function 

£4 = 1- °%T )A = 1 - C o ( sI ~ A o)-'A = 1 + C (si - A 0 )- 1 (-A) 

A(s) A(s) 

where C 0 = [1, 0, ... , 0] T € ft 11 " 1 


Aq — 


-A 


In - 2 


g 7 £(ra-l)x(ra-l) 


0 


it follows from Lemma A.l that 

W c i(s) = —Cj (si - Ao)' 1 = ~ a . n 7 2 . (s) , W bl (s) = (si - A 0 )- 1 (-A) 

A (s) 

Similarly W ci , H'l,.; i = 2, 3, . . . , n result from swapping with 


x ^=Cj(sI~A 0 )- 1 d l 

where di = [0, . . . , 0, 1, 0, . . . , 0] T £ R n ~ 1 has all its elements equal to zero except 
for the (i — 1) — th element that is equal to one. Therefore it follows from Lemma A. 1 
that 


W C i(s) = —Cq (si — A 0 ) _1 = — - 


£-200 
A(s) ' 


i = 2, 3, . . . , n 


W bi (s) = (si- A 0 ) 1 d i , i = 2,3, ... ,5 


If we define 


A 


v 


A (a) ’ A(s) 

and use the expressions for W C i(s), W b i(s), (5.4.5) becomes 


V = 


A (s) 

A *(s) 

o£- 2 ( S ) 

A (S) 


—Q'ty — 


A ( 3 ) 


Vi 


n 

55 {[( s/_ A 0 ) ^diy] d n _i — [(si- A 0 )~ 1 d i u] &„_*}-! h 


where v is partitioned as v = [i>i,i> T ] T with v\ £ 1Z 1 and v £ 7 Z n 2 and d\ = —A. 
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Choosing v± = 0 and 

n 

V = | [(si - A 0 )~ 1 d i y] a n -i - [(si - A 0 ) -1 djii] 6„_i| 

i = 1 

we obtain 

V = ^{s)^ T ^ + et ( 5A6 ) 

Because 

aw = »-■ + aw„_ 2 w = = c t (s7 _ A . r t b 

A *(s) A *(s) A *(s) K ’ 

where b x = [1, A T ] T G 7 Z n and C = [1,0,..., 0] T G TZ n . For B c = b x , (5.4.3) 
is a minimal state-space representation of (5.4.6). Because A(s) is arbitrary, its 
coefficient vector A can be chosen so that is SPR. The signal v is implementable 

because a,i,bi and u, y are available for measurement. □ 

We can now use (5.4.3) instead of (5.4.2) to develop an adaptive law for 
generating 6 = aj T . Using the results of Chapter 4, it follows that the 
adaptive law is given by 

6 = Te(j), e = y = y-y (5.4.7) 

where T = T t > 0. 

We summarize the main equations of the adaptive observer developed 
above in Table 5.3. The structure of the adaptive observer is shown in 
Figure 5.2. 

Theorem 5.4.1 The adaptive observer presented in Table 5.3 guarantees 
that for any bounded input signal u, 

(i) all signals are u.b. 

(ii) y(t) = y - y -> 0 as t oo. 

(iii) d p (t), b p (t) G £2 C L ^ and converge to zero as t — > 00. 

In addition, if u is sufficiently rich of order 2n, then 

(iv) x(t) = x a (t) — x(t), d p (t ) = a p (t) — a p , b p (t ) = b p (t) — b p converge to 
zero exponentially fast. 
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Table 5.3 Adaptive observer with auxiliary input 
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Figure 5.2 Structure of the adaptive observer with auxiliary input signal. 


Proof The main equations that describe the stability properties of the adaptive 
observer are the error equation (5.4.3) that relates the parameter error with the 
state observation error and the adaptive law (5.4.7). Equations (5.4.3) and (5.4.7) 
are analyzed in Chapter 4 where it has been shown that e, y £ £2 n£oo and 0 £ Coo 
for any signal vector <j> with piecewise continuous elements. Because u, y £ Coo, it 
follows that (f> £ Coo and from (5.4.3) and (5.4.7) that 9,e,y € Coo and 0 £ /12- 
Using e,y £ £2 D £<x> together with e,y £ Coo, we have y(t) — > 0, e{t) — > 0 as 
t — ► 00 , which implies that 9(t) — > 0 as t — > 00 . From 6 £ £ 2 D Coo, it follows that 
v £ £2 n£oo and v(t) — > 0 as t — > 00 . Hence, all inputs to the state equation (5.4.2) 
are in Coo, which implies that x £ Coo, he., x £ Coo- 

In Chapter 4 we established that ii </>,</> £ Coo and 4> is PE, then the error 
equations (5.4.3) and (5.4.7) guarantee that 9(t) — > 0 as t — » 00 exponentially fast. 
In our case (t>,(f> £ Coo and therefore if u is chosen so that (f> is PE then 9(t) — > 0 as 
t — > 00 exponentially fast, which implies that e{t),y(t), 9(t), v (t), a p [t)y, b p (t)u and, 
therefore, x(t) converge to zero exponentially fast. To explore the PE property of 
4>, we note that <f> is related to u through the equation 

Oi n — 1 (s) 

-a n ~i(s)C T {si - A)- 1 ^ \ U 

Because u is sufficiently rich of order 2 n, the PE property of <t> can be established 
by following exactly the same steps as in the proof of Theorem 5.2.4. □ 



The auxiliary signal vector v can be generated directly from the signals 
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<j) and 6, i.e., the filters Wj for y,u do not have to be implemented. This 
simplification reduces the number of integrators required to generate cj), v 
considerably and it follows from the relationship 

{si - A 0 y 1 d i = Qi a 'T^ , i= 1 , 2 , . . . , n 
A{s) 

where Qi € 7^.( n_1 ) x ( n_1 ) are constant matrices whose elements depend on 
the coefficients of the numerator polynomials of {si — Ao)" 1 ^ (see Problem 
2 . 12 ). 

As with the adaptive Luenberger observer, the adaptive observer with 
auxiliary input shown in Table 5.3 requires the input u to be sufficiently 
rich of order 2 n in order to guarantee exact plant state observation. The 
only difference between the two observers is that the adaptive Luenberger 
observer may employ any one of the adaptive laws given in Tables 4.2, 4.3, 
and 4.5 whereas the one with the auxiliary input given in Table 5.3 relies 
on the SPR-Lyapunov design approach only. It can be shown, however, (see 
Problem 5.16) by modifying the proof of Lemma 5.4.1 that the observation 
error y may be expressed in the form 

y = -6 T (j) (5.4.8) 

by properly selecting the auxiliary input v and <j). Equation (5.4.8) is in the 
form of the error equation that appears in the case of the linear parametric 
model y = 0* T (j) and allows the use of any one of the adaptive laws of 
Table 4.2, 4.3, and 4.5 leading to a wide class of adaptive observers with 
auxiliary input. 

The following example illustrates the design of an adaptive observer with 
auxiliary input v and the generation of v from the signals 0, 6. 

Example 5.4.1 Let us consider the second order plant 



— ai 

1 ' 


' h ' 

Xq, 

-a 0 

0 

Xq, + 

. b ° . 

y = [ 

1 0] x a 





where a\ > 0,ao > 0,bi,bo are unknown constants and y,u are the only available 
signals for measurement. 
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We use Table 5.3 to develop the adaptive observer for estimating x a and the un- 
known plant parameters. We start with the design variables. We choose a* = [4 , 4] T , 
A = Ao, A(s) = s + Aq. Setting Ao = 3 we have 


[1 


0][sl — A*) 



S + 3 

(5 + 2) 2 


which is SPR. 

The signal vector </> 


— rnU, —thU. r^y, 

s+3 ’ s+3 ’ s+3^’ 




nT 


is realized as follows 


0 — [01> 02j 03i 04] 


where 

0 1 — u — 3^i, 0i — — 30i T u 

02 = 01 

03 = -y + 303, 03 = -303 + y 

04 = 03 

with 0i (0) = 0 , 03 ( 0 ) = 0. 

For simplicity we choose the adaptive gain T = diag{10, 10, 10, 10}. The adap- 
tive law is given by 

Si = 100! (y - y), b 0 = lO0 2 (y y) 

Si = 1003 (j/ — y)i a 0 = lO0 4 (y - y) 

0 


and the signal vector v = 
3 


by 


s T 3 


y a 1 


s T 3 


u 0 \ 


&i - 


1 


s T 3 


2/ ao + 


s T 3 


U Oq 


— 3ai03 — 3Si0i — ao03 + S o 0i 

= 10(y - y) (-30304 - 30102 + 04 + 02 ) 


The observer equation becomes 


x = 


-ai 

1 ' 


' Si ' 

u + 

\ 4-ai 1 

~a 0 

0 

x + 

. b 0 . 

1 

i O 

1 

i 


(y - y) 


o 

10 


(y - y) [-30304 - 30102 


The input signal is chosen as u = Ai sinwif + sin u> 2 t for some Ai, A 2 ^ 0 and 

Wl ^ U} 2 - V 
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5.5 Adaptive Observers for Nonminimal Plant 
Models 

The adaptive observers presented in Sections 5.3 and 5.4 are suitable for 
estimating the states of a minimal state space realization of the plant that 
is expressed in the observer form. Simpler (in terms of the number of inte- 
grators required for implementation) adaptive observers may be constructed 
if the objective is to estimate the states of certain nonminimal state-space 
representations of the plant. Several such adaptive observers have been pre- 
sented in the literature over the years [103, 108, 120, 123, 130, 172], in this 
section we present only those that are based on the two nonminimal plant 
representations developed in Chapter 2 and shown in Figures 2.2 and 2.3. 

5.5.1 Adaptive Observer Based on Realization 1 

Following the plant parameterization shown in Figure 2.2, the plant (5.3.1) 
is represented in the state space form 


01 

= A c 0i + lu, 

0i (0) = 0 

02 

= A c 02 - ly, 

02(0) = 0 

L b 

= A c uj, w(0) 

= Wo = B 0 x 0 

Vo 

= C 0 T o; 


z 

= y + A t 0 2 = 

r T 0 + r/ o 


y = 0* T (j) - A T 0 2 + ??o 

where w £ TZ n , 0 = [(j)J , 02~] T , 0* e Tv!” , * = 1, 2; A c € 7 Z nxn is a known stable 
matrix in the controller form; l = [1, 0, . . . , 0] T £ lZ n is a known vector such 
that (si — A c )~ l l = a X(sj S ^ an d A( s ) = det(sl — A c ) = s n + A T a n _i(s), 
A = [A n _i, . . . , A 0 ] t ; 0* = [b n -i,b n - 2 , ■ ■ ■ , bo, a n -i, o n _ 2 , • • • ,ao] T £ E? n are 
the unknown parameters to be estimated; and Bq £ 7 Z nxn is a constant 
matrix defined in Section 2.4. 

The plant parameterization (5.5.1) is of order 3 n and has 2 n unknown 
parameters. The state ui and signal r/o decay to zero exponentially fast with 
a rate that depends on A c . Because A c is arbitrary, it can be chosen so that 
r/o, w go to zero faster than a certain given rate. Because ^i(0) = 02(0) = 0, 
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the states (f>i , cf ) 2 can be reproduced by the observer 

<t>\ = A c ^i + lu, <£i(0) = 0 

(f) 2 = A c </> 2 - ly, = 0 (5.5.2) 

which implies that 4>i(t) = i = 1,2 Vi > 0. The output of the observer 

is given by 

zo = 0* T rf> 

yo = 6* r j> - A t 0 2 = 6* T (j) - A t ^ 2 (5.5.3) 

The state oj in (5.5.1) can not be reproduced exactly unless the initial 
condition xq and therefore u>o is known. 

Equation (5.5.2) and (5.5.3) describe the nonminimal state observer for 
the plant (5.3.1) when 6* is known. Because cj){t) = (f>(t),\/t > 0, it follows 
that the output observation errors e z = z — zo, eo = y — yo satisfy 

e 2 = e 0 = % = Cj e Act u 0 

which implies that eo,e z decay to zero exponentially fast. The eigenvalues 
of A c can be regarded as the eigenvalues of the observer and can be assigned 
arbitrarily through the design of A c . 

When 6* is unknown, the observer equation (5.5.2) remains the same but 
(5.5.3) becomes 

z = y + A > 2 

y = 6 T 0-\ T j> 2 (5.5.4) 

where 9{t) is the estimate of 9* at time t and (j>(t) = <f>(t) is generated from 
(5.5.2). Because 9* satisfies the parametric model 

z = y + \ t 4>2 = 9* T ct> + 770 (5.5.5) 

where z, (j) are available for measurement and r/o is exponentially decaying 
to zero, the estimate 9{t) of 9* may be generated using (5.5.5) and the 
results of Chapter 4. As shown in Chapter 4, the exponentially decaying to 
zero term 7]q does not affect the properties of the adaptive laws developed 
for r/o = 0 in (5.5.5). Therefore, for design purposes, we can assume that 
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r/o = 0 Vi > 0 and select any one of the adaptive laws from Tables 4.2, 4.3, 
and 4.5 to generate 6(t). In the analysis, we include / 0 and verify that 
its presence does not affect the stability properties and steady state behavior 
of the adaptive observer. As an example, let us use Table 4.2 and choose 
the gradient algorithm 


0 = T e<f), e=z—z=y—y (5.5.6) 

where T = T t > 0 and <f> = <f>. 

Equations (5.5.2), (5.5.4) and (5.5.6) form the adaptive observer and are 
summarized in Table 5.4 and shown in Figure 5.3. 

The stability properties of the adaptive observer in Table 5.4 are given 
by the following theorem. 

Theorem 5.5.1 The adaptive observer for the nonminimal plant represen- 
tation (5.5.1) with the adaptive law based on the gradient algorithm or any 
other adaptive law from Tables f.2, f.3, and f.5 that is based on the para- 
metric model (5.5.5) guarantees that 

(i) cf)(t ) = (f(t) Vt > 0. 

(ii) All signals are u.b. 

(iii) The output observation error e(t) = y{t) — y(t ) converges to zero as 
t — > oo. 

(iv) If u is sufficiently rich of order 2 n, then 9{t ) converges to 9*. The 
convergence of 9(f) to 9* is exponential for all the adaptive laws of Tables 
4-2, 4-3 > an d 4-5 the exception of the pure least squares where 
convergence is asymptotic. 

Proof (i) This proof follows directly from (5.5.1) and (5.5.2). 

(ii) Because <j> = f, the following properties of the adaptive laws can be es- 
tablished using the results of Chapter 4: (a) e, 0 £ T ^ ; (b) e, 9 £ T <*, f) T 2 for 
the continuous time adaptive laws and \0k+i — 9k\ £ b for the hybrid one. From 
e, 9, 6 € Too 5 u £ Too, the stability of A c and the stability of the plant, we have that 
all signals are u.b. 

(iii) For the continuous-time adaptive laws, we have that 

e = z-z = y + \ T (j >2 ~(y + A t <^ 2 ) = y-y 

We can verify that e € Too which together with e € T 2 imply that e(t) — > 0 as 
t — > oo. 
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Table 5.4 Adaptive observer (Realization 1) 


Plant 

4 >i = + lu, </> i(0) =0 

4>2 = A c 02 - ly, 02(0) = 0 

c Jj = A c u, a>( 0 ) = ujq 

V 0 = Cju 

z = y + A t 0 2 = 0* T 4> + Vo 

y = 0* T (j) - \ T (j) 2 + 77 0 

Observer 

<t> 1 =A c (j)i + lu, 0 i(O) = 0 

cj ) 2 = Ac0 2 - ly, ^2(0) = 0 
z = 8 T (j) 
y = z- A t ^2 

Adaptive law 

0 = Tecj), r = r T > 0 

e = z — z, z = 8 T cj) 

Design 

variables 

A c is a stable matrix; (A c , l) is in the controller form; 
A(s) = det(sl — A c ) = s n + A T a n _i(s) 


For the hybrid adaptive law, we express e as 

e = 0* T (j) - 0j(p = -9 k <j), \/t£[tk,tk+ 1 ) 

Because the hybrid adaptive law guarantees that (a) 0 k £ l^, (b) \0 k +i—0 k \ £ l 2 and 
\9k+i — 0k\ > 0 as k — ■> 00 , we can construct a continuous, piecewise linear function 

9(t) from 9 k using linear interpolation that satisfies: (a) \0 — 0 k \, 0 £ Too fl ^2 and 
(b) 1 0{t) — 0k(t) | — > 0 as t — > 00 . Therefore, we can write 

e = -9jcj>=-9 T (j>+(9-0 k ) T (j> (5.5.7) 

From e £ C 2 , \0 — 9 k \ € C 2 and </> £ we have 0 T </> £ C 2 . Because 4i(0 r <j>) = 
0 <[> + 0 T 4> and 0,0, <f>,<j> £ Coo , w e conclude that J^(0 T </>) £ Coo , which, together 
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Figure 5.3 Adaptive observer using nonminimal Realization 1. 


with 9 t < f> £ C- 2 , implies that 9 T (f> — > 0 as t — > oo. Therefore, we have established 
that (9 — 9k) T 4> — > 0 and 9 T <j> — > 0 as t — > oo. Thus, using (5.5.7) for e, we have 
that e(t) — > 0 as t — > oo. 

(iv) The proof of this part follows directly from the properties of the adaptive 
laws developed in Chapter 4 and Theorem 5.2.4 by noticing that </> = H(s)u where 
H(s) is the same as in the proof of Theorem 5.2.4. □ 


The reader can verify that the adaptive observer of Table 5.4 is the state- 
space representation of the parameter identifier given by (5.2.17), (5.2.18) 
in Section 5.2.3. This same identifier is shared by the adaptive Luenberger 
observer indicating that the state (j) of the nonminimal state space repre- 
sentation of the plant is also part of the state of the adaptive Luenberger 
observer. In addition to (j), the adaptive Luenberger observer estimates the 
state of a minimal state space representation of the plant expressed in the 
observer canonical form at the expense of implementing an additional nth 
order state equation referred to as the observer equation in Table 5.1. The 
parameter identifiers of Section 5.2 can, therefore, be viewed as adaptive 
observers based on a nonminimal state representation of the plant. 
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5.5.2 Adaptive Observer Based on Realization 2 

An alternative nonminimal state-space realization for the plant (5.3.1), de- 
veloped in Chapter 2 and shown in Figure 2.5, is described by the state 
equations 

xi = — Ao.xi + 0* T (f>, xi(0) = 0 

fa = A c fa+lu , 0i (0) = 0 

fa = A c <f>2 - ly , 02(0) = 0 (5.5.8) 

$ = K0i”,y,0 2 ] t 
Cj = A c cu, iu(0) = loo 
y o = Cqu 

y = xi + r? 0 

where x\ £ IZ\ fa £ IZ n ~ 1 ,i = 1,2; 9* £ 7?. 2n is a vector of linear 

combinations of the unknown plant parameters a = [a n _i, a n _ 2 , . . . , ao] T > 
b = [6 n _i, b n - 2 , ■ ■ ■ 0o] T i n the plant transfer function (5.3.7) as shown in 
Section 2.4; Ao > 0 is a known scalar; A c £ 7^,( n ^ 1 ) x ( n ^ 1 ) i§ a known stable 
matrix; and (si — A c ) -1 / = , where A*(s) = s n_1 + g n _ 2 S n_2 -|-. . . + <70 

is a known Hurwitz polynomial. When 6* is known, the states xi, fa, fa can 
be generated exactly by the observer 

Xoi = — A 0 x o i + 9* T fa x o i(0) = 0 

fa = A c fa + lu, fa{0) = 0 

fa = A cfa - ly, fa(0 ) = 0 (5.5.9) 

yo = x 0 i 

where x 0 i,fa are the estimates of xifa respectively. As in Section 5.5.1, no 
attempt is made to generate an estimate of the state u, because uj(t) — » 0 
as t — > 00 exponentially fast. The state observer (5.5.9) guarantees that 
x 0 i(t) = xi(t),fa(t) = fa(t),fa(t) = 02 (t) Vt > 0. The observation error 
eo = y — yo satisfies 

e 0 = 770 = Cq e Act uj 0 

i.e. , eo {t) — > 0 as t — > 00 with an exponential rate that depends on the matrix 
A c that is chosen by the designer. 
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When 8* is unknown, (5.5.9) motivates the adaptive observer 
x\ = — Ao&i + 0 T cj), xo(0) = 0 

4>i = Acfa + lu, (j) i(0) = 0 

k = Kfc-Iy, <M0) = o (5.5.10) 

y = x i 

where 8(t) is the estimate of 8* to be generated by an adaptive law and 
x\ is the estimate of x\. The adaptive law for 8 is developed using the 
SPR-Lyapunov design approach as follows: 

We define the observation error y = y — y = x\ + r/o, where x\ = x\ — x\, 
and use it to develop the error equation 


y = -Ao y - 8 T 4> + Cju 


(5.5.11) 


where Cj = AoCq" + Cq A c and 8 = 8 — 8* by using (5.5.8) and (5.5.9), and 
the fact that cj) = <t>. Except for the exponentially decaying to zero term Cj u;, 
equation (5.5.11) is in the appropriate form for applying the SPR-Lyapunov 
design approach. 

In the analysis below, we take care of the exponentially decaying term 
Cj lu by choosing the Lyapunov function candidate 



2 


+ (3lu t P C LU 


where T = T t > 0; P c = Pj > 0 satisfies the Lyapunov equation 


P c A c + AjP c = -I 


and P > 0 is a scalar to be selected. The time derivative V of V along the 
solution of (5.5.11) is given by 

V = -X 0 y 2 -y9 T (/) + yCju + 8 T T- 1 8 - Pu t cj (5.5.12) 


As described earlier, if the adaptive law is chosen as 

8 = 8 = Ty(f> (5.5.13) 

(5.5.12) becomes 

V = — Aq y 2 — /3 lo t lu + yCj u> 
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\Q 1 2 

If (3 is chosen as /3 > then it can be shown by completing the squares 

that 

T > ^ A o ~2 P T 

V < y lo uj 

~ 2 2 

which implies that V,6,y £ Coo and y £ £ 2 - Because from (5.5.11) y £ £00 
it follows that y —* 0 as t — > 00 , i.e., y(t) —> y(t) and x\ (t) — * xi(t) as t —* 00 . 


lT 


Hence, the overall state of the observer X = \3:\ . , 02 J converges to the 

r _ T T i T 

overall plant state X = x\ ,(f>{ ,(f >2 as t — > 00 . In addition 9 £ £2 fl C 0 0 
and lirn^oo 0(f) = 0. 

The convergence of 0(f) to 0* depends on the properties of the input u. 
We can show by following exactly the same steps as in the previous sections 
that if u is sufficiently rich of order 2 n, then <fi = (f> is PE, which, together 
with cj), cj) £ Coo, implies that 9(t ) converges to 9* exponentially fast. For 
4> £ Coo, we require, however, that u £ Coo- 

The main equations of the adaptive observer are summarized in Table 
5.5 and the block diagram of the observer is shown in Figure 5.4 where 
9 = [9j, 9j] T is partitioned into 6\ £ IZ n , M £ H n . 

Example 5.5.1 Consider the LTI plant 

hs + b 0 


y = 


s 2 + ais + ao 


(5.5.14) 


where an, ao > 0 and b\, bo are the unknown parameters. We first obtain the plant 
representation 2 by following the results and approach presented in Chapter 2. 

We choose A(s) = (s + Ao )(s + A) for some Ao, A > 0. It follows from (5.5.14) 
that 


Ms) 


y = [bi,b 0 \ 


1 


Ms) 


u - [ai,a 0 ] 


1 


Ms) 


y 


Because ;= 1 — ( A o+ A ) s + A o A we p ave 
A(s) A(s) 

ai(s) 


V 1 T7W u ~ °2 "TTW y + A 


Ol(s) , tt q i( s ) 


Ms) 


Ms) 


Ms) 


(5.5.15) 


where 9\ = [bi,b 0 ] T ,9% = [a\, ao] T , A = [A 0 + A, AoA] T and ai(s) = [s, 1] T . Because 
A(s) = (s + Aq)(s + A), equation (5.5.15) implies that 


1 


V = 


s + Aq 


q*T 

7 1 


OjXg) 
s H- A 


«l(s) 
72 s + A J 


u _ e *T"n°i y + X T 


Ol(s) 
s A 1 
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Table 5.5 Adaptive observer (Realization 2) 


Plant 

x\ = — AoXi + 9* T (f>, xi( 0 ) = 0 

0i = A c 0i + lu, <f> 1 (0)=0 

4*2 = Kfo - ly , <£ 2 ( 0 ) = 0 

Co = A c oo, a>(0) = loq 

n o = cj uo 

y = X 1 + r /0 

where 0 = [u, 4>J ,y, (f>J] T 

4>i £ 'K n ~ 1 , i = 1 , 2 ; x'i E 710 

Observer 

xi = — Aoxi + 0 T 0 , xi( 0 ) = 0 

0 1 = A c 0i + Zu, 0i(O) — 0 

02 = A c 02 - ly, 02 ( 0 ) = 0 

y = \ 

where 0 = [u, 4>J,y, 4>J ] T 

02 £ TZ n ~ 1 ,i = 1 , 2 , x'i £ 710 

Adaptive law 

0 = Ty 0 , y = y-y 

Design 

variables 

T = r T > 0 ; A c £ 7 £( n- 1 ) x ( n ~i) is any stable matrix, 
and Aq > 0 is any scalar 


Substituting for 


«i(s) 1 

s 


■ i ' 

1 

1 

' -A ' 

s A s H- A 

1 


0 

s + A 

1 


we obtain 


1 


V = 


s + Aq 


b 1 u+ (b 0 - A&i) 1 u - ax y - (a 0 - Xax) 1 y 
s + A s + A 

1 


+ (A 0 + A )y - Xr — — y 
s + A 
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Figure 5.4 Adaptive observer using nonminimal Realization 2. 
which implies that 

X\ = — Ao^i + 9* T cf>, 5 1 (0) = 0 
</> i = -Xtpi+u, </>i(0)=0 
<t> 2 = -Xfc-y, 4> 2(0) = 0 

y = x 1 

where 4> = [u, <f>i, y, (f> 2 \ T , 9* = [61,60 ~ A6i,Ao + A — ai,ao — Aai + A 2 ] T . Using 
Table 5.5, the adaptive observer for estimating Xi,4>\, (f > 2 and 0* is given by 

x\ = — A0X1 + 9 T cf), xi(0) = 0 

(j>\ = -A^i + u, ^i(0)= 0 

$2 = A0 2 - y, <£2(0) = 0 

y = x 1 

6 = Y(j>(y-y) 

where <t> = [u, (f>i,y, fa} 1 and T = T t > 0. If in addition to 9*, we like to estimate 
9* = [61, 60, ai, ao] T , we use the relationships 

61 = h 

60 = 02 + A01 

ai = —03 + Ao + A 

a 0 = 04 — A 03 + AA 0 
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where 9 t , i = 1,2, 3, 4 are the elements of 9 and bi,di,i = 1,2 are the estimates of 
bi,a,i,i = 0,1, respectively. 

For parameter convergence we choose 

u = 6 sin 2.67 + 8 sin 4.27 

which is sufficiently rich of order 4. □ 


5.6 Parameter Convergence Proofs 

In this section we present all the lengthy proofs of theorems dealing with convergence 
of the estimated parameters. 

5.6.1 Useful Lemmas 

The following lemmas are used in the proofs of several theorems to follow: 

Lemma 5.6.1 If the autocovariance of a function x : 1Z + i— > 77" defined as 

A 1 po+T 

Rx(t) = lim — / x(t)x t (t + r)dr (5.6.1) 

T^oo 1 J to 

exists and is uniform with respect to to, then x is PE if and only if R x ( 0) is positive 
definite. 

Proof 

If: The definition of the autocovariance i?a;(0) implies that there exists a To > 0 
such that 

^ ^ pto+To o 

2^(0) <Y o J' x(t)x t (t) dr < -R x ( 0), Vf > 0 

If R x ( 0) is positive definite, there exist 01,02 > 0 such that a±I < R x ( 0) < 02 1. 
Therefore, 

^ ^ /*to+To ^ 

—cx\I < — x(r)x T (r)dT < -a 2 I 

1 1 0 Jto 1 

for all 1 0 > 0 and thus x is PE. 

Only if: If x is PE, then there exist constants oo,Ti >0 such that 

t+Ti 

x(t)x t ( r)dr > OqTi/ 
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for all t > 0. For any T > T), we can write 

rto+T k - 1 ,t 0 +(*+ 1)T X /-io+T 

/ s(r)a; T (r)dT = / a’(r)a: T (T)fir + / a;(r)a; T (r)(iT 

Jto o •'io+^i */to+fcT'i 

> ka^TiI 

where k is the largest integer that satisfies fc < T/T\, i.e., fcTi < T < (k + l)Ti. 
Therefore, we have 


/ t+i kTi 

x(t)x t (r)dr > —a 0 I 


For fc > 2, we have = l ' k+ ^ Tl — S- > 1 — & > b, thus, 


A* ^0 ~ I - 

; / x(r)x T (r)(ir > 

J to 2 


2 /’to+T 1 

f? a ,(0) = lim — / a;(r)a; T (r)d- 

T ^°° 1 J tn 


, ^ Cfi) r 

T > i 

“ 2 


which implies that i?a;(0) is positive definite. 


Lemma 5.6.2 Consider the system 


y = H(s)u 

where H (s) is a strictly proper transfer function matrix of dimension m x n with 
stable poles and real impidse response h(t). If u is stationary, with autocovariance 
R u (t), then y is stationary, with autocovariance 

/ OO pOO 

/ h[r\)R u (t + Ti - r 2 )h T (T 2 )dT 1 dT 2 
-oo J —oo 


and spectral distribution 


Proof See [201]. 


S v (u>) = H(-jiv)S u (w)H T (jw) 


Lemma 5.6.3 Consider the system described by 


x\ _ A — F T (t) Xi 

x 2 P 1 F(t)P 2 0 x 2 


(5.6.2) 
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where x\ € 1Z ni ,X2 G lZ rni for some integer r,ni > 1, A,P±,P 2 are constant 
matrices and F(t) is of the form 


m = 


z\In 1 

Ini 


Zrln± 


G lZ rni * Ul 


where Zi,i = 1,2, ... ,r are the elements of the vector z G lZ r . Suppose that z is PE 
and there exists a matrix Pq > 0 such that 


Pq + Aq Pq + PqAq + CqC < 0 


(5.6.3) 


where 

, Cj = [I ni ,0] 

Then the equilibrium x\ e = 0,X2 e = 0 of (5.6.2) is e.s. in the large. 


A —F T (t) 
PiF(t)P 2 0 


Proof Consider the system (5.6.2) that we express as 


x = Ao(t)x 
y = Cq x = xi 


(5.6.4) 


where x = [xl,xl] T . We first show that (Co,A 0 ) is UCO by establishing that 
(Co, Aq + KCq ) is UCO for some K G £oo which according to Lemma 4.8.1 implies 
that (Co, Aq) is UCO. We choose 


~7 Pi - A 
-P 1 F(t)P 2 


for some 7 > 0 and consider the following system associated with (C 0 , A 0 + KCq ): 



-him 

0 


2/1 = [In 1 




Ul 

U 2 


(5.6.5) 


According to Lemma 4.8.4, the system (5.6.5) is UCO if 


FfP) 


1 

s + 7 


F(t) 


1 r t+T 

al rn 1 < — / F f (r)Fj (r )dr < (3I rni , Vf > 0 


satisfies 


(5.6.6) 
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for some constants a,/3,T > 0. We prove (5.6.6) by first showing that F(t) satisfies 

al rni <^j F{t)F t {t)(It < (3 I rni , Vt > 0 

for some constants a , fi as follows: Using a linear transformation, we can express 
F(t) as 

F(t) = F 0 Z(t) 

where F 0 £ K™ 1 xrrV| j s a constant matrix of full rank, Z £ -j^rnixm j g a block 
diagonal matrix defined as Z = diag{^, z , . . . , z}, i.e., 

0 ' 

0 


0 

0 


0 


Zl 


ZZ T = diag {zz T ,zz T , . . . , zz T j 


Because z is PE, we have 

1 r t+T 

ot\I r — rp I zz T dr < a 2 I r , Vt > 0 

T Jt 

for some a\,a 2 ,T > 0. Therefore, 

1 r*+ T 

atilrm < j, J Z(t )Z t (t)cIt < a 2 I rni , Vt > 0 

which implies that 

1 r t + T 

a x F 0 Fq < — / F(t )F t (t)cLt < cx 2 FoFq , Vt > 0 


Z = 


Zi 0 

z 2 0 


0 z\ 


0 Zr, 


0 0 


0 0 


Therefore, FF T = F 0 ZZ t Fq and 
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Because Fo is of full rank, we have 

PlTni < Fo F q < /? 2 I r m 

for some constants j3\ , (h > 0. Hence, 

/ 1 / 
a Em < ^ J F(t)F t ( r)dT < f3 I rni , Vi > 0 (5.6.7) 

where /? = 02/52, ct = a±0i. 

Following the same arguments used in proving Lemma 4.8.3 (iv), one can show 
(see Problem 5.18) that (5.6.7) implies (5.6.6). 

Because all the conditions in Lemma 4.8.4 are satisfied, we conclude, by apply- 
ing Lemma 4.8.4 that (5.6.5) is UCO, which in turn implies that (5.6.2) is UCO. 
Therefore, it follows directly from Theorem 3.4.8 and (5.6.3) that the equilibrium 
X\ e = 0, X2e = 0 of (5.6.2) is e.s. in the large. □ 

5.6.2 Proof of Theorem 5.2.1 

According to Lemma 5.6.1, Theorem 5.2.1 can be proved if we establish that f?^( 0) 
is positive definite if and only if u is sufficiently rich of order n. 

If: We will show the result by contradiction. Because u is stationary and R$(Q) 
is uniform with respect to t, we take t = 0 and obtain [186] 

1 r T i 

R<t>( 0) = lim — / 4>(t)4> t (t) dr = — / S,j,(u)dw (5.6.8) 

T—>oo 1 J 0 2n J_ OQ 

where S^w) is the spectral distribution of <f>. From Lemma 5.6.2, we have 

S<j>(uj) = H(-ju>)S u (u>)H T (jw) (5.6.9) 

Using the condition that u is sufficiently rich of order n. i.e., u has spectral lines at 
n points, we can express S u (u>) as 

n 

S u (co) = ^ fu(ui)5(u> - u>i) (5.6.10) 

i—i 

where /„(o;,) > 0. Using (5.6.9) and (5.6.10) in (5.6.8), we obtain 

1 n 

R <t>(6) = — 'Y^fu(u i )H(-ju} i )H T (ju i ) 

71 i— 1 

Suppose that J?^( 0) is not positive definite, then there exists x £ lZ n with x A 0 
such that 

n 

x t R, i> {0)x = ^ f u (ui)x T H(-juji)H r {jui)x = 0 

i= 1 


(5.6.11) 
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Because f u (x>i) > 0 and each term under the summation is nonnegative, (5.6.11) 
can be true only if: 


x T H(—jLOi)H T (ju>i)x = 0, i = 1, 2, . . . , n 

or equivalently 

x l H{-jLo i ) = 0, i = 1, 2, . . . , n (5.6.12) 

However, (5.6.12) implies that {H(jwi), i?(jw 2 )i • • • , H(juj n )} are linearly depen- 
dent, which contradicts with the condition that . . . H(ju> n ) are linearly 

independent for all u)\, . . . , w„. Hence, f?^( 0) is positive definite. 

Only if: We also prove this by contradiction. Assume that R^( 0) is positive 
definite but u is sufficiently rich of order r < n, then we can express f?^( 0) as 

R*( °) = ^ '52Mu i ) H (-M) HT (-M) 

2=1 

where f u (wi) > 0- Note that the right hand side is the sum of r — dyads , and the 
rank of i?^( 0) can be at most r < n, which contradicts with the assumption that 
i?^( 0) is positive definite. □ 


5.6.3 Proof of Theorem 5.2.2 

We first consider the series-parallel scheme (5.2.11). From (5.2.10), (5.2.11) we 
obtain the error equations 


i\ = A m e i — Bu — Ax 
A = jeix T , B = jeiu 


(5.6.13) 


where A = A — A 1 B = B — B and B £ lZ nxl , A £ A nxn . For simplicity, let us take 
7i = 72 = 7- The parameter error A is in the matrix form, which we rewrite in the 
familiar vector form to apply the stability theorems of Chapter 3 directly. Defining 
the vector 

e = [dJ,dJ,...,dl,B T ] T £TlAn+l) 
where di is the itli column of A , we can write 


Ax + Bu = [d\, 0 , 2 , ■ ■ ■ , a n ] 


Xl 

X 2 


X 


n 


+ Bu=F r (t)e 
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where P T (t) = [aq/„, x-i I n , ■ ■ ■ , x n I n , uI n )£lZ nxn O +1 ) . Because ai = ^e\Xi, B = ^e\u, 
the matrix differential equations for A, B can be rewritten as 

9 = 'yF(t)e 1 


Therefore, (5.6.13) is equivalent to 


ei — A m e i — F T (t)8 

8 = jF(t)ei 


(5.6.14) 


which is in the form of (5.6.2). To apply Lemma 5.6.3, we need to verify that all 
the conditions stated in the lemma are satisfied by (5.6.14). 

We first prove that there exists a constant matrix P 0 > 0 such that 

Aq Pq + PqAq = —CqC'q 


where 
A 0 = 


A -F T (t) 
7.F(i) 0 


j^(n+n(n+l))x (n+n(n+l)) 0] G 7^ n +"( ra + 1 ) 


In Section 4.2.3, we have shown that the time derivative of the Lyapunov function 


f A T PA 1 

l 71 / 


V = el Pei + tr 
(with 71 = 72 = 7) satisfies 
where P satisfies A^P + PA m = —I n . Note that 


tr 


b t pb 

72 


V = -ejei 


tr 

tr 


{a t pa} = J2~ a J p ~ a < 

i= 1 
►T ; 


|p t pp| 


= B' PB 


(5.6.15) 


where the second equality is true because B € 1Z nxl . We can write 

1 n ] 

V = eJ Pei H aj Pat H B T PB = x T PqX 

7i 72 

where x = [e^ , 0 T ] T and Pq is a block diagonal matrix defined as 

P 0 = diag {P, 7 f X P, . . . ,7^,72 X P} G ^ (n+n(n+1))x(n+n(n+1)) 


n-times 
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Hence, (5.6.15) implies that 

V = x T (P 0 A 0 + Aq P 0 )x = -x T C 0 Cjx 


or equivalently 

Pq + PqAq + A iJ" Pq = —C'oCj , Pq — 0 

Next, we show that z = [x\, X 2 , ■ ■ ■ , x n , tt] T is PE. We write 


z = H(s)u, H(s) 


(si — A)~ 1 B 
1 


(5.6.16) 


If we can show that H(jw i), H(ju 2 ), . . . , H(ju) n+ i) are linearly independent for 
any u>i, iv 2 , ■ ■ ■ , w n +i, then it follows immediately from Theorem 5.2.1 that z is PE 
if and only if u is sufficiently rich of order n + 1. 

Let a(s ) = det (si — A) = s n + a„_is n_1 + . . . + aqs + ao- We can verify using 
matrix manipulations that the matrix (si — A)^ 1 can be expressed as 

(si -A)- 1 = -^-{ls n - 1 + (A + a n - 1 I)s n - 2 + (A 2 +a n - 1 A + a n - 2 I)s n - 3 

a(s) 

+ . . . + (A 71 - 1 + a n ^A n - 2 + . . . + ai /)} (5.6.17) 

Defining 

bi=B, b 2 = (A + a n -\I)B , b^ = (A"-\-a n -iA-\-a n - 2 I)B, . . . ,b n = (A n 1 +... + ai I)B 

l= [h,b2,...,b n \ &n nxn 

and then using (5.6.17), H(s ) can be conveniently expressed as 



- g n - 1 - 



S n ~ 2 


L 

s 



1 



a(s) 



1 

«(s) 


(5.6.18) 


To explore the linear dependency of H(ju>i), we define H = [H(ju>i), H(ju> 2 ), ■ ■ 
H(ju n +i)]. Using the expression (5.6.18) for H(s), we have 


H 


(M)"- 1 (M) n_1 

( m )"- 2 

M JU 2 

l l 

a(M) a(M) 


(M+i)" 1 
(M+i ) n_2 


ju-n+l 

1 

a(M+i) 
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0 
0 

0 

i 

a(jUJn + l) 

From the assumption that ( A , B) is controllable, we conclude that the matrix L 
is of full rank. Thus the matrix H has rank of n + 1 if and only if the matrix V\ 
defined as 

(MM 1 (MM 1 ••• CM+iM 1 

cmm 2 cmm 2 ... (ju) n+ i) n ~ 2 

ju 1 M ■ ■ ■ M+ 1 
1 1 ••• 1 
a(M) a(jw 2 ) ••• a(ju n+1 ) 

has rank of n + 1. Using linear transformations (row operations), we can show that 
V\ is equivalent to the following Vandermonde matrix [62]: 

CM)" CM)” ••• (M+l)" 

(mm 1 cmm 1 ••• (M+i)”- 1 

(MM 2 (MM 2 ••• (M+l)”- 2 


ju i M • • • M+i 

l l ••• l 

Because 

det(V) = (M - M) 

l<i<k<n-\-l 

V is of full rank for any u>i with A b k = 1, . . . n + 1. This leads to 
the conclusion that V) and, therefore, // have rank of n + 1, which implies that 
H{ju>2), ■ ■ ■ ,i?(M+l) are linearly independent for any w 1 ,+>2, • • •+’n+i- It 
then follows immediately from Theorem 5.2.1 that z is PE. 

Because we have shown that all the conditions of Lemma 5.6.3 are satisfied by 
(5.6.14), we can conclude that x e = 0 of (5.6.14) is e.s. in the large, i.e. , a, 9 — » 0 
exponentially fast as t — > oo. Thus, A — > A, B — > U exponentially fast as t — > oo, 
and the proof of Theorem 5.2.2 for the series-parallel scheme is complete. 

For the parallel scheme, we only need to establish that z = [a: T ,u] T is PE. The 
rest of the proof follows by using exactly the same arguments and procedure as in 
the case of the series-parallel scheme. 

Because x is the state of the plant and it is independent of the identification 
scheme, it follows from the previous analysis that z is PE under the conditions given 
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in Theorem 5.2.2, i.e. , ( A , B ) controllable and u sufficiently rich of order n+ 1. From 
the definition of ei, we have 


x = x — ei, z = z — 


Jnx 1 


l 


ei 


thus the PE property of z follows immediately from Lemma 4.8.3 by using € C 2 
and ^ being PE. □ 


5.6.4 Proof of Theorem 5.2.3 

We consider the proof for the series-parallel scheme. The proof for the parallel 
scheme follows by using the same arguments used in Section 5.6.3 in the proof of 
Theorem 5.2.2 for the parallel scheme. 

Following the same procedure used in proving Theorem 5.2.2, we can write the 
differential equations 

( i = A m e i + (A — A)x + (13 — B)u 
A = 'yeix 7 
B = 7eitt T 

in the vector form as 

h = A m e 1 -F r (t)9 

6 = F(t)e i (5.6.19) 

where 9 = [aj ,aj , . . . ,a T ,bj ,bj , . . . ,bV] T and denotes the ith column of 

A, B, respectively, F T (t) = [x\ x 2 I n , • ■ • , x n I n , Uil n , u 2 I n , ■■■ , u q I n \. Following 
exactly the same arguments used in the proof for Theorem 5.2.2, we complete the 

proof by showing (i) there exists a matrix P 0 > 0 such that A J" Po + Po^4o = —Co Cj , 

where Ao, Co are defined the same way as in Section 5.6.3 and (ii) 2 = [x\, x 2 , ■ ■ ., 
x n , Mi, u 2 , . . ., u q ] T is PE. 

The proof for (i) is the same as that in Section 5.6.3. We prove (ii) by showing 
that the autocovariance of z, R z ( 0) is positive definite as follows: We express 2 as 


2 = 


{sI-A^B 

h 


« = £ 


{si-AyHi 


where I q € 1Z qxq is the identity matrix and 6,; , e, denote the ith column of B,I q , 
respectively. Assuming that u-i , i = 1 are stationary and uncorrelated, the 
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autocovariance of z can be calculated as 

9 


-j y /> OO 

R z (°) = ^ J H i(-M s Ui {u)H7 (juj) 


where 


Hi(s) = 


{ sI -A)-\ 

e,; 


and S Ui (u>) is the spectral distribution of u, . Using the assumption that u, is 
sufficiently rich of order n + 1, we have 


n+1 


fui(uik)8(uj - u ik ) 


k = 1 


and 


n+1 


nOO 1 - i - 

^ J Hi(-ju>)S Ui {u)Hj (, juj ) = — ^ /«< (uik)Hi(-ju}ik)H? (jwik) 
where f Ui (wik) > 0- Therefore, 


fc=i 


j?,(o) = fui{ui k )Hi(-ju ik )Hj (jw ik ) 


q n+1 


(5.6.20) 


2n 

i — 1 /c— 1 

Let us now consider the solution of the quadratic equation 

x t R z ( 0)s = 0, xG77"+« 

Because each term under the summation of the right-hand side of (5.6.20) is semi- 
positive definite, x T i? z (0)a; = 0 is true if and only if 


f Ui (uik)x T Hi(-ju ik )Hj (ju ik )x = 0, k = i 2 " n + 1 


or equivalently 


Hj (juik)x = 0, 


i = l,2,...q 
k = 1, 2, . . . n + 1 


(5.6.21) 


Because Hi(s) 
alent to: 


i [ adj {al - A)bi 
a ( s ) a(s)e* 


where a(s) = det(sI-A), (5.6.21) is equiv- 


Hj (Mk)x = 0, 


Hi(s) = a(s)Hi(s), 


i = 1,2, ...q 
k = 1, 2, . . . n + 1 


(5.6.22) 
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Noting that each element in Hi is a polynomial of order at most equal to n, we 
find that gi(s) = Hj (s)x is a polynomial of order at most equal to n. Therefore, 
(5.6.22) implies that the polynomial gi{s) vanishes at n + 1 points, which, in turn, 
implies that gi(s) = 0 for all s G C. Thus, we have 

Hj(s)x = 0, * = 1,2,... 5 (5.6.23) 


for all s. Equation (5.6.23) can be written in the matrix form 

|T 


adj {si — A) B 
a(s)I q 


x = 0„ 


(5.6.24) 


where 0 ? G 1Z q is a column vector with all elements equal to zero. Let X = 
n n , Y = [x n+1 , . . . , x n+q ] G n q , i.e., X = [X T , Y T ] T . Then (5.6.24) 
can be expressed as 


(adj (si - A)B) T X + a(s)Y = 0, (5.6.25) 

Consider the following expressions for adj(s/ — A)B and a(s): 

adj (aI-A)B = Bs^A (AB + a n _ 1 B)s n ~ 2 + (A 2 B + a^AB + a n _ 2 B)s n ~ 3 
+ . . . + (A n ~ x B + a n _iA n - 2 S + . . . + oi B) 

u(s) = S U CL n — \S n “(“... T UlS CLq 


and equating the coefficients of s‘ l on both sides of equation (5.6.25), we find that 
X , Y must satisfy the following algebraic equations: 


' Y = 0 

B T X + a n „ 1 Y = 0 q 
, ( AB + a n -iB) T X + a n - 2 Y = 0 9 

( A n ~ 1 + a n - 1 A n ~ 2 B + . . . + oi B) T X + a 0 Y = 0, 

or equivalently: 

Y = 0 q and (B,AB,...,A n - 1 S) T X = 0 nq (5.6.26) 

where 0 nq is a zero column-vector of dimension nq. Because {A 1 B) is controllable, 
the matrix (B, AB , . . . A n ~ 1 B) is of full rank; therefore, (5.6.26) is true if and only 
if X = 0, Y = 0, i.e. x = 0. 

Thus, we have proved that x T R z (0)x = 0 if and only if x = 0, which implies 
that R z (0) is positive definite. Then it follows from Lemma 5.6.1 that z is PE. 

Using exactly the same arguments as used in proving Theorem 5.2.2, we con- 
clude from ^ being PE that ei(f), 9(t) converge to zero exponentially fast as t — > oo. 
From the definition of 9 we have that A(t) — > A, B(t) — > B exponentially fast as 
t — > oo and the proof is complete. □ 
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5.6.5 Proof of Theorem 5.2.5 


Let us define 


m = e T (t)R^o)m 


where 6(t) = 9(t) — 9*. We will show that e(t) — > 0 as t — > oo, i.e., for any given 
e > 0 , there exists a t\ > 0 such that for all t>t\, e(t) < e . 

We express e as 

e(t) = 9 r (t / 4 > { T ) ( I )T ( T )dr9(t) 


+9 T (t) i?^( 0) - 


0(r)^ T (r)(ir 0(f) 


fti+ T _ 1 fti+T _ _ 

; j (0 T ( T )(j)( T )) 2 d T + — j {(9 T (mr)r - (# t ( t )</>( t )) 2 } cLt 


+9 1 (t) fy(0) - 


— ei(t) + e 2 (f) + 63 (f) 


0(r)0 T (r)(ir 0(f) 


where t\ . T are arbitrary at this point and will be specified later. We evaluate each 
term on the right-hand side of the above equation separately. 

Because e(f) = 0 T (f)</>(f) — > 0 as t — > oo, there exists a t 1 > 0 such that 

|e(f)| < for t > f). Choosing t\ > t\, we have 


i rti+T 

\ei(t)\ = r j t e 2 (r)d 


v< rs 


(5.6.27) 


For e 2 (f), we have 


£2 (t) = 


(9(t) - 9 (t)) t (/)(t)(I) T ( r)(9(t) + 0(t))0t 


- / 9{a)da ) 0(r)(/> T (r)(0(t) + 9(t))cIt 


Using the property of the adaptive law that 9 — > 0 as f — > oo, for any given 
T, e we can find a t 2 > 0 such that for all t > t 2 , \9{t)\ < 2 ^ T , where K = 
sup t r |^(r)</> T (r)(0(t) + 0(r))| is a finite constant because of the fact that both 9 
and <fi are uniformly bounded signals. Choosing t\ > t 2 , we have 


|ea(t)| < f 


1 r t i+ T 2e (t — t) i , / x , T / 


3/Cr ( r )( 6 '( i ) + 6l ( T ))l dr < ^ (5.6.28) 
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for all t>t 2 . 

Because the autocovariance R^( 0) exists, there exists T 0 > 0 such that for all 
T>T 0 , 

R<P( 0 ) ~ f £ <j>(T)<j) T (T)d,T 

where K = sup 4 \9(t)\ 2 is a finite constant. Therefore, 

|es| < | (5.6.29) 

Combining (5.6.27) to (5.6.29), we have that for t > max{t 1 ,t 2 } = t±, T > T 0 , 

e(t) = e T {t)R^0)0(t) < e 

which, in turn, implies that e; therefore, i?^(O)0 converges to zero as t —* oo. □ 



5.7 Problems 


5.1 Show that 


^4sin(t + ip) 
sin(f) 


where A, ip are nonzero constants is PE in IT 2 . 


5.2 Consider the following plant 

bis 

9 = i i i\2 u 

( s + 1 y 

where foi is the only unknown parameter. Is u = Cq (constant ) ^ 0 sufficiently 
rich for identifying &i? Explain. Design a parameter identifier to identify b\ 
from the measurements of u, y. Simulate your scheme on a digital computer 
for bi = 5. 


5.3 Consider the plant 

b 2 s 2 + b 0 

V= Js^W U 

where &2 , are the only unknown parameters. Is u = sin t sufficiently rich 
for identifying &2,5o where 62,^0 can be any number in 1Z. Explain. Design 
a parameter identifier to identify b 2 ,bo- Simulate your scheme for a) b 2 = 
1 , 6 q = 1 and b) &2 = 3, bo = 5. 
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5.4 Consider the second order stable plant 



where ai,ag,bi are the only unknown parameters and u € £oo- 

(a) Design a parameter identifier to estimate the unknown parameters. 

(b) Choose u with the least number of frequencies that guarantees param- 
eter convergence. 

(c) Simulate your scheme for a\ = —2, ag = —5, b\ = 8. 


5.5 Simulate the series-parallel identifier in Example 5.2.2. Repeat Example 5.2.2 

by designing and simulating a parallel identifier. In simulations use numerical 
values for A, B of your choice. 

5.6 Perform the simulations requested in (i) and (ii) of Example 5.2.3 when bg = 

— 2,ai = 2.8, ag = 5.6. Comment on your results. 

5.7 Repeat Problem 5.6 when the pure least-squares algorithm in Example 5.2.3 is 

replaced with the least-squares with covariance resetting algorithm. 

5.8 Repeat Problem 5.6 when the pure least-squares algorithm in Example 5.2.2 is 

replaced with the 

(a) Integral algorithm 

(b) Hybrid adaptive law. 

5.9 Design an adaptive Luenberger observer for the plant 


—a i 1 
x = „ 

-ag 0 

V = [1,0] a; 


x + 


bi 

1 


u 


where a ± , ag > 0 and bi ^ 0 are the only unknown parameters using the 
following adaptive laws for on-line estimation: 

(a) Integral algorithm 

(b) Pure least-squares 

(c) Hybrid adaptive law 

In each case present the complete stability proof. Simulate the adaptive 
observers with inputs u of your choice. For simulation purposes assume that 
the unknown parameters have the following values: a\ = 2.5, ag = 3.6, b± = 4. 

5.10 Repeat Problem 5.9 by designing an adaptive observer with auxiliary input. 

5.11 Consider the LTI plant 

bg 

y = u 

s A + ais + a 2 
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where bo yf 0, and ai,a 2 > 0. Represent the plant in the following forms: 

(a) Observable form 

(b) Nonminimal Realization 1 

(c) Nonminimal Realization 2 

5.12 Design an adaptive observer using an integral adaptive law for the plant of 
Problem 5.11 represented in the observer form. 

5.13 Repeat Problem 5.12 for the same plant in the nonminimal Realization 1. 

5.14 Design an adaptive observer for the plant of Problem 5.11 expressed in the 
nonminimal Realization 2. 

5.15 Consider the following plant: 

V = W 0 (s)G(s)u 

where Wo(s) is a known proper transfer function with stable poles and G(s) 
is a strictly proper transfer function of order n with stable poles but unknown 
coefficients. 


(a) Design a parameter identifier to identify the coefficients of G(s). 

(b) Design an adaptive observer to estimate the states of a minimal realiza- 

tion of the plant. 

5.16 Prove that there exists a signal vector v G 7 Z n available for measurement for 
which the system given by (5.4.2) becomes 

y = -d T (j) 


where (f> = H(s) 


u 

y 


G lZ 2n and H(s) is a known transfer matrix. 


5.17 Use the result of Problem 5.16 to develop an adaptive observer with auxiliary 
input that employs a least-squares algorithm as a parameter estimator. 

5.18 Let F(t) : 1Z i— > 7 Z nxm and F,Fg C 0 c . If there exist positive constants 
k -[ , k -2 ■ Tq such that 


i rt+To 

kiln <7f F(t)F t (t)cLt < k 2 In 

lo Jt 

for any t > 0, where I n is the identity matrix of dimension n, show that if 


Ff = 


-F 


with a > 0, b yf 0, then there exist positive constants k 1 ,k 2 ,T 0 such that 


kil n < „/ 

J- n 


rt+To 


o Jt 


Ff(r)Fj (r)dr < k 2 I n 


for any t > 0. 



Chapter 6 


Model Reference Adaptive 
Control 

6.1 Introduction 

Model reference adaptive control (MRAC) is one of the main approaches 
to adaptive control. The basic structure of a MRAC scheme is shown in 
Figure 6.1. The reference model is chosen to generate the desired trajectory, 
y m , that the plant output y p has to follow. The tracking error e\ = y v — y m 
represents the deviation of the plant output from the desired trajectory. 
The closed-loop plant is made up of an ordinary feedback control law that 
contains the plant and a controller C(0 ) and an adjustment mechanism that 
generates the controller parameter estimates 0(t ) on-line. 

The purpose of this chapter is to design the controller and parameter ad- 
justment mechanism so that all signals in the closed-loop plant are bounded 
and the plant output y p tracks y m as close as possible. 

MRAC schemes can be characterized as direct or indirect and with nor- 
malized or unnormalized adaptive laws. In direct MRAC, the parameter vec- 
tor 6 of the controller C{9) is updated directly by an adaptive law, whereas 
in indirect MRAC 6 is calculated at each time t by solving a certain algebraic 
equation that relates 9 with the on-line estimates of the plant parameters. 
In both direct and indirect MRAC with normalized adaptive laws, the form 
of C(6), motivated from the known parameter case, is kept unchanged. The 
controller C{6) is combined with an adaptive law (or an adaptive law and 
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Figure 6.1 General structure of MRAC scheme. 


an algebraic equation in the indirect case) that is developed independently 
by following the techniques of Chapter 4. This design procedure allows the 
use of a wide class of adaptive laws that includes gradient, least-squares and 
those based on the SPR-Lyapunov design approach. On the other hand, 
in the case of MRAC schemes with unnornralized adaptive laws, C(9) is 
modified to lead to an error equation whose form allows the use of the SPR- 
Lyapunov design approach for generating the adaptive law. In this case, 
the design of C{9) and adaptive law is more complicated in both the di- 
rect and indirect case, but the analysis is much simpler and follows from a 
consideration of a single Lyapunov-like function. 

The chapter is organized as follows: In Section 6.2, we use several exam- 
ples to illustrate the design and analysis of a class of simple direct MRAC 
schemes with unnornralized adaptive laws. These examples are used to mo- 
tivate the more general and complicated designs treated in the rest of the 
chapter. In Section 6.3 we define the model reference control (MRC) prob- 
lem for SISO plants and solve it for the case of known plant parameters. 
The control law developed in this section is used in the rest of the chapter 
to form MRAC schemes in the unknown parameter case. 

The design of direct MRAC schemes with unnornralized adaptive laws is 
treated in Section 6.4 for plants with relative degree n* = 1, 2, 3. The case of 
n* > 3 follows by using the same techniques as in the case of n* = 3 and is 
omitted because of the complexity of the control law that increases with n* . 
In Section 6.5 we consider the design and analysis of a wide class of direct 
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MRAC schemes with normalized adaptive laws for plants with arbitrary but 
known relative degree. 

The design of indirect MRAC with unnornralized and normalized adap- 
tive laws is considered in Section 6.6. In Section 6.7, we briefly summarize 
some efforts and alternative approaches to relax some of the basic assump- 
tions used in MRAC that include the minimum phase, known relative degree 
and upper bound on the order of the plant. In Section 6.8, we present all 
the long and more complicated proofs of theorems and lemmas. 

6.2 Simple Direct MRAC Schemes 

In this section, we use several examples to illustrate the design and analysis 
of some simple direct MRAC schemes with unnornralized adaptive laws. We 
concentrate on the SPR-Lyapunov approach for designing the adaptive laws. 
This approach dominated the literature of adaptive control for continuous- 
time plants with relative degree n* = 1 because of the simplicity of design 
and stability analysis [48, 85, 172, 201]. 

6.2.1 Scalar Example: Adaptive Regulation 

Consider the following scalar plant: 

x = ax + u, x'(0) = xq (6.2.1) 

where a is a constant but unknown. The control objective is to determine 
a bounded function u = f(t,x) such that the state x(t) is bounded and 
converges to zero as t — ► oo for any given initial condition xq. Let —a m be 
the desired closed-loop pole where a m > 0 is chosen by the designer. 

Control Law If the plant parameter a is known, the control law 

u = -k*x (6.2.2) 

with k* = a + a m could be used to meet the control objective, i.e., with 
(6.2.2), the closed- loop plant is 


x = —a m x 
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whose equilibrium x e = 0 is e.s. in the large. 

Because a is unknown, k* cannot be calculated and, therefore, (6.2.2) 
cannot be implemented. A possible procedure to follow in the unknown 
parameter case is to use the same control law as given in (6.2.2) but with k* 
replaced by its estimate k(t), i.e., we use 

u = —k(t)x (6.2.3) 

and search for an adaptive law to update k(t) continuously with time. 

Adaptive Law The adaptive law for generating k(t) is developed by view- 
ing the problem as an on-line identification problem for k* . This is ac- 
complished by first obtaining an appropriate parameterization for the plant 
(6.2.1) in terms of the unknown k* and then using a similar approach as in 
Chapter 4 to estimate k* on-line. We illustrate this procedure below. 

We add and subtract the desired control input — k*x in the plant equation 
to obtain 

x = ax — k* x + k* x + u. 

Because a — k* = —a m we have 

x = —a m x + k*x + u 


or 

x = — - — ( u + k*x ) (6.2.4) 

s + a m 

Equation (6.2.4) is a parameterization of the plant equation (6.2.1) in terms 
of the unknown controller parameter k* . Because x,u are measured and 
a m > 0 is known, a wide class of adaptive laws may be generated by simply 
using Tables 4.1 to 4.3 of Chapter 4. It turns out that the adaptive laws 
developed for (6.2.4) using the SPR-Lyapunov design approach without nor- 
malization simplify the stability analysis of the resulting closed-loop adaptive 
control scheme considerably. Therefore, as a starting point, we concentrate 
on the simple case and deal with the more general case that involves a wide 
class of adaptive laws in later sections. 

Because — ( — is SPR we can proceed with the SPR-Lyapunov design 
approach of Chapter 4 and generate the estimate x of x as 


x = 


[kx + u] 


1 


( 0 ) 


s + a m 


s + a m 


(6.2.5) 



6.2. SIMPLE DIRECT MRAC SCHEMES 


317 


where the last equality is obtained by substituting the control law u = —kx. 
If we now choose x(0) = 0, we have x[t) = 0 , Vt > 0, which implies that the 
estimation error ei defined as e± = x — x is equal to the regulation error, 
i.e., €\ = x , so that (6.2.5) does not have to be implemented to generate 
x. Substituting for the control u = —k[t)x in (6.2.4), we obtain the error 
equation that relates the parameter error k = k — k* with the estimation 
error e± = x , i.e., 

ei = — a m ei — kx, e\ = x ( 6 . 2 . 6 ) 


or 

ei = (—kx) 

s + a m ' ' 

As demonstrated in Chapter 4, the error equation (6.2.6) is in a convenient 
form for choosing an appropriate Lyapunov function to design the adaptive 
law for k(t). We assume that the adaptive law is of the form 

k = ie = fi(ei,x,u) (6.2.7) 


where f\ is some function to be selected, and propose 

v 'M)=l + C (6 ' 2 ' 8) 

for some 7 > 0 as a potential Lyapunov function for the system (6.2.6), 
(6.2.7). The time derivative of V along the trajectory of (6.2.6), (6.2.7) is 
given by 

V = —a m ej — ke \x + (6.2.9) 

7 

Choosing f\ = yeix, i.e., 

k = qeix = jx 2 , k( 0 ) = ko ( 6 . 2 . 10 ) 

we have 

V = —a m e\ < 0 (6.2.11) 


Analysis Because V is a positive definite function and V < 0, we have 
V € Coo , which implies that e\ . k € Too- Because ei = x. we also have 
that x G Coo and therefore all signals in the closed-loop plant are bounded. 
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Furthermore, e\ = x 6 £2 and e\ = x € Coo (which follows from (6.2.6) ) 
imply, according to Lemma 3.2.5, that ei (t) = x(t) — > 0 as t — > 00. From 
x(t) —> 0 and the boundedness of k, we establish that k(t) — * 0, u(t) — > 
0 as t — > 00. 

We have shown that the combination of the control law (6.2.3) with the 
adaptive law (6.2.10) meets the control objective in the sense that it forces 
the plant state to converge to zero while guaranteeing signal boundedness. 

It is worth mentioning that as in the simple parameter identification 
examples considered in Chapter 4, we cannot establish that k(t) converges 
to k*, i.e. , that the pole of the closed-loop plant converges to the desired one 
given by — a m . The lack of parameter convergence is less crucial in adaptive 
control than in parameter identification because in most cases, the control 
objective can be achieved without requiring the parameters to converge to 
their true values. 

The simplicity of this scalar example allows us to solve for e± = x explic- 
itly, and study the properties of k(t),x(t) as they evolve with time. We can 
verify that 


ei (*) 
k{t) 


2cc 

c + ko — a + (c — ko + a) e~ 2ct 1 ^’ 1 

c [(c + ko - a) e 2ct - (c— k 0 + a)] 

0 + (c + k 0 - a) e 2ct + (c - k 0 + a) 


(6.2.12) 


where c 2 = 7X9 + (ko — a) 2 , satisfy the differential equations (6.2.6) and 
(6.2.10) of the closed-loop plant. Equation (6.2.12) can be used to investigate 
the effects of initial conditions and adaptive gain 7 on the transient and 
asymptotic behavior of x(t), k(t). We have lirn^oo k(t) = a + c if c>0 and 
lirn^oo k(t) = a — c if c < 0, i.e., 


lim k(t) 
£—>00 



(6.2.13) 


Therefore, for xq 7^ 0, k(t) converges to a stabilizing gain whose value de- 
pends on 7 and the initial condition xo,ko. It is clear from (6.2.13) that 
the value of k oo is independent of whether ko is a destabilizing gain, i.e., 
0 < ko < a, or a stabilizing one, i.e., ko > a, as long as (ko — a) 2 is the same. 
The use of different ko, however, will affect the transient behavior as it is 
obvious from (6.2.12). In the limit as t — » 00, the closed-loop pole converges 
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H 0 ) 

Figure 6.2 Block diagram for implementing the adaptive controller 
(6.2.14). 

to — (k 0 o — a), which may be different from —a m . Because the control ob- 
jective is to achieve signal boundedness and regulation of the state x(t) to 
zero, the convergence of k(t) to k* is not crucial. 

Implementation The adaptive control scheme developed and analyzed 
above is given by the following equations: 

u = — k(t)x , k = 'yeix = jx 2 , k( 0) = ko (6.2.14) 

where x is the measured state of the plant. A block diagram for implementing 
(6.2.14) is shown in Figure 6.2. 

The design parameters in (6.2.14) are the initial parameter fco and the 
adaptive gain 7 > 0. For signal boundedness and asymptotic regulation of 
x to zero, our analysis allows ko, 7 to be arbitrary. It is clear, however, from 
( 6 . 2 . 12 ) that their values affect the transient performance of the closed-loop 
plant as well as the steady-state value of the closed-loop pole. For a given 
ko,xo / 0 , large 7 leads to a larger value of c in ( 6 . 2 . 12 ) and, therefore, 
to a faster convergence of x{t) to zero. Large 7 , however, may make the 
differential equation for k “stiff” (i.e., k large) that will require a very small 
step size or sampling period to implement it on a digital computer. Small 
sampling periods make the adaptive scheme more sensitive to measurement 
noise and modeling errors. 
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Remark 6.2.1 In the proceeding example, we have not used any reference 
model to describe the desired properties of the closed-loop system. A 
reasonable choice for the reference model would be 

= X m {R) = X m Q (6.2.15) 

which, by following exactly the same procedure, would lead to the 
adaptive control scheme 

u = —k(t)x, k = yeix 

where e\ = x — x m . If x m o / xo, the use of (6.2.15) will affect the 
transient behavior of the tracking error but will have no effect on the 
asymptotic properties of the closed-loop scheme because x m converges 
to zero exponentially fast. 

6.2.2 Scalar Example: Adaptive Tracking 

Consider the following first order plant: 

x = ax + bu (6.2.16) 

where a, b are unknown parameters but the sign of b is known. The control 
objective is to choose an appropriate control law u such that all signals in 
the closed-loop plant are bounded and x tracks the state x m of the reference 
model given by 

Xm = a m'Xm T b !n V 


i.e., 


%m, — 


(6.2.17) 


for any bounded piecewise continuous signal r(t), where a m > 0, b rn are 
known and x m (t),r(t) are measured at each time t. It is assumed that 
a m , b rn and r are chosen so that x m represents the desired state response of 
the plant. 


Control Law For x to track x m for any reference input signal r(f), the 
control law should be chosen so that the closed-loop plant transfer function 
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from the input r to output x is equal to that of the reference model. We 
propose the control law 

u = —k*x + l*r (6.2.18) 

where k* , l* are calculated so that 

Ms) = bP = _brn_ = XM ( 6 . 2 . 19 ) 

r(s) s — a + bk* s + a m r(s) 

Equation (6.2.19) is satisfied if we choose 

l* = hy, k* = a ^ (6.2.20) 

provided of course that 6 / 0, i.e. , the plant (6.2.16) is controllable. The 
control law (6.2.18), (6.2.20) guarantees that the transfer function of the 
closed-loop plant, i.e., is equal to that of the reference model. Such 
a transfer function matching guarantees that x(t) = x m (t),Vt > 0 when 
,x(0) = x m (0) or | x(t) — x m (t) | — > 0 exponentially fast when x(0) / x m (0), 
for any bounded reference signal r(t). 

When the plant parameters a,b are unknown, (6.2.18) cannot be imple- 
mented. Therefore, instead of (6.2.18), we propose the control law 

u — — k{t)x T l(t)r (6.2.21) 

where k(t),l(t) is the estimate of A ;*,/*, respectively, at time t, and search 
for an adaptive law to generate k(t),l(t) on-line. 


Adaptive Law As in Example 6.2.1, we can view the problem as an on- 
line identification problem of the unknown constants k*,l*. We start with 
the plant equation (6.2.16) which we express in terms of k * , l* by adding and 
subtracting the desired input term —bk*x + bl*r to obtain 

x = —a m x + b m r + b ( k*x — l*r + u) 


i.e., 

x= — — — r- 1 — (k*x — l*r + u) (6.2.22) 

S + CLm & &m 

Because x m = r is a known bounded signal, we express (6.2.22) in terms 

S~\~CLrn v 7 

of the tracking error defined as e = x — x m , i.e., 

b 

e = 


s + a m 


( k*x — l*r + u ) 


(6.2.23) 



322 


CHAPTER 6. MODEL REFERENCE ADAPTIVE CONTROL 


Because b is unknown, equation (6.2.23) is in the form of the bilinear 
parametric model considered in Chapter 4, and may be used to choose an 
adaptive law directly from Table 4.4 of Chapter 4. 

Following the procedure of Chapter 4, the estimate e of e is generated as 

e = - — - — b (kx — Ir + u) = — - — (0) (6.2.24) 

S i Qm S i CLjfyi 

where the last identity is obtained by substituting for the control law 


u = —k{t)x + l(t)r 


Equation (6.2.24) implies that the estimation error, defined as e\ — e — e, 
can be simply taken to be the tracking error, i.e., e± = e, and, therefore, 
there is no need to generate e. Furthermore, since e is not generated, the 
estimate b of b is not required. 

Substituting u = —k(t)x + l(t)r in (6.2.23) and defining the parameter 
errors k = k — k* ,1 = l — l * , we have 


ei 


e = 


S + On 


[—kx + Ir^j 


or 

ei = —a m e i + b [—kx Elr'j, e\ = e = x — x rn (6.2.25) 

As shown in Chapter 4, the development of the differential equation (6.2.25) 
relating the estimation error with the parameter error is a significant step 
in deriving the adaptive laws for updating k(t),l(t). We assume that the 
structure of the adaptive law is given by 


k = h {ei,x,r,u) , 1 = f 2 (ei, x, r, u) (6.2.26) 


where the functions /i, /2 are to be designed. 

As shown in Example 6.2.1, however, the use of the SPR-Lyapunov ap- 
proach without normalization allows us to design an adaptive law for k, l 
and analyze the stability properties of the closed-loop system using a single 
Lyapunov function. For this reason, we proceed with the SPR-Lyapunov ap- 
proach without normalization and postpone the use of other approaches that 
are based on the use of the normalized estimation error for later sections. 
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Consider the function 

<6 ' 2 - 27) 

where 71,72 > 0 as a Lyapunov candidate for the system (6.2.25), (6.2.26). 
The time derivative V along any trajectory of (6.2.25), (6.2.26) is given by 

V = —a m e\ — bke \x + ble\r + + ^-/ 2 (6.2.28) 

7i 72 

Because | 6 | = bsgn(b), the indefinite terms in (6.2.28) disappear if we choose 
fi = 7 iei£Sgn(&), / 2 = — 7 2 eir sgn( 6 ). Therefore, for the adaptive law 

k = 7 ieix sgn( 6 ), / = — 7 2 €ir sgn( 6 ) (6.2.29) 

we have 

V = —a m e\ (6.2.30) 

Analysis Treating x m (t),r(t) in (6.2.25) as bounded functions of time, it 
follows from (6.2.27), (6.2.30) that V is a Lyapunov function for the third- 
order differential equation (6.2.25) and (6.2.29) where x m is treated as a 
bounded function of time, and the equilibrium e\ e = e e = 0 , k e = 0 , l e = 0 
is u.s. Furthermore, e\,k,l G and ei G £ 2 . Because e\ = e = x — x rn 
and x m G /Lx, we also have x G £00 and u G Cx ; therefore, all signals in 
the closed-loop are bounded. Now from (6.2.25) we have ei G £ 00 > which, 
together with ei G £ 2 , implies that ei (t) = e(t) — > 0 as t — > 00 . 

We have established that the control law (6.2.21) together with the adap- 
tive law (6.2.29) guarantee boundedness for all signals in the closed-loop 
system. In addition, the plant state x(t) tracks the state of the refer- 
ence model x m asymptotically with time for any reference input signal r, 
which is bounded and piecewise continuous. These results do not imply 
that k(t) —> k* and l(t) — > /* as t — ► 00 , i.e. , the transfer function of the 
closed- loop plant may not approach that of the reference model as t — ► 00 . 
To achieve such a result, the reference input r has to be sufficiently rich of 
order 2. For example, r(t) = sin uit for some u 7 ^ 0 guarantees the expo- 
nential convergence of x(t) to x m (t) and of k(t),l(t) to k*,l*, respectively. 
In general, a sufficiently rich reference input r(t) is not desirable especially 
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Reference Model 



Figure 6.3 Block diagram for implementing the adaptive law (6.2.21) and 
(6.2.29). 

in cases where the control objective involves tracking of signals that are not 
rich in frequencies. Parameter convergence and the conditions the reference 
input r has to satisfy are discussed later on in this chapter. 
Implementation The MRAC control law (6.2.21), (6.2.29) can be im- 
plemented as shown in Figure 6.3. The adaptive gains 71,72 are designed 
by following similar considerations as in the previous examples. The ini- 
tial conditions 1(0), k( 0 ) are chosen to be any a priori guess of the unknown 
parameters l*,k*, respectively. Small initial parameter error usually leads 
to better transient behavior. As we mentioned before, the reference model 
and input r are designed so that x m describes the desired trajectory to be 
followed by the plant state. 

Remark 6.2.2 The assumption that the sign of b is known may be relaxed 
by using the techniques of Chapter 4 summarized in Table 4.4 and is 
left as an exercise for the reader (see Problem 6.3). 
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6.2.3 Vector Case: Full-State Measurement 

Let us now consider the nth order plant 

x = Ax + Bu, x £ lZ n (6.2.31) 

where A £ 7Z nxn ,B £ are un k nown constant matrices and {A, B) is 

controllable. The control objective is to choose the input vector u £ IZ q such 
that all signals in the closed-loop plant are bounded and the plant state x 
follows the state x m £ 1Z n of a reference model specified by the LTI system 

%m = A m X m T B m r (6.2.32) 

where A m £ 7L nxn is a stable matrix, B m £ lZ nxq , and r £ lZ q is a bounded 
reference input vector. The reference model and input r are chosen so that 
x m (t) represents a desired trajectory that x has to follow. 

Control Law If the matrices A, B were known, we could apply the control 
law 

u = —K*x + L*r (6.2.33) 

and obtain the closed-loop plant 

x = (A — BK*)x + BL*r (6.2.34) 

Hence, if K* £ IZ qxn and L* £ lZ qxq are chosen to satisfy the algebraic 
equations 

A - BK* = Am, BL* = B m (6.2.35) 

then the transfer matrix of the closed- loop plant is the same as that of the ref- 
erence model and x(t) — > x m (t) exponentially fast for any bounded reference 
input signal r(t). We should note that given the matrices A,B,A m ,B m , 
no K*,L* may exist to satisfy the matching condition (6.2.35) indicating 
that the control law (6.2.33) may not have enough structural flexibility to 
meet the control objective. In some cases, if the structure of A, B is known, 
A m ,B m may be designed so that (6.2.35) has a solution for K*,L*. 

Let us assume that K*,L* in (6.2.35) exist, i.e., that there is sufficient 
structural flexibility to meet the control objective, and propose the control 
law 


u = —K(t)x + L(t)r 


(6.2.36) 
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where K(t),L(t) are the estimates of K*,L*, respectively, to be generated 
by an appropriate adaptive law. 

Adaptive Law By adding and subtracting the desired input term, namely, 
— B(K*x—L*r ) in the plant equation and using (6.2.35), we obtain 

x = A m x + B m r + B(K*x — L*r + u) (6.2.37) 

which is the extension of the scalar equation (6.2.22) in Example 6.2.2 to 
the vector case. Following the same procedure as in Section 6.2.2, we can 
show that the tracking error e = x — x rn and parameter error K = K — K * , 
L = L — L* satisfy the equation 

e = A m e + B(— Kx + Lr) (6.2.38) 

which also depends on the unknown matrix B. In the scalar case we manage 
to get away with the unknown B by assuming that its sign is known. An 
extension of the scalar assumption of Section 6.2.2 to the vector case is as 
follows: Let us assume that L* is either positive definite or negative definite 
and T- 1 = L*sgn(/), where l = 1 if L* is positive definite and Z = — 1 if L* 
is negative definite. Then B = and (6.2.38) becomes 

e = A m e + B m L*~ 1 (—Kx + Lr) 

We propose the following Lyapunov function candidate 

V(e, K, L) = e T Pe + tr [K r TK + L t TL\ 

where P = P T > 0 satisfies the Lyapunov equation 

P d-m + A^P = — Q 
for some Q = Q T > 0. Then, 

V = —e T Qe + 2e T PB m L*~ 1 (—Kx + Lr) + 2tr [K t TK + L T rL] 


Now 

e T PB m L*~ 1 Kx = tr[x T K T TB^Pe\sgn(l) = ti-[K T TB^Pex T ]sgn{l) 
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and 

e Y PB m L*- 1 Lr = ti[L T TB^Per T ]sgn(l) 

Therefore, for 

K = K = Bj n Pex T sgii(l), L = L = —B^ n Per T sgn(l) (6.2.39) 

we have 

V = —e T Qe 

Analysis From the properties of V. V, we establish as in the scalar case 
that K(t), L{t ), e(t) are bounded and that e(t) — > 0 as t — > oo. 

Implementation The adaptive control scheme developed is given by 
(6.2.36) and (6.2.39). The matrix B m P acts as an adaptive gain matrix, 
where P is obtained by solving the Lyapunov equation PA m + Aj n P = —Q 
for some arbitrary Q = Q T > 0. Different choices of Q will not affect bound- 
edness and the asymptotic behavior of the scheme, but they will affect the 
transient response. The assumption that the unknown L* in the matching 
equation BL* = B m is either positive or negative definite imposes an ad- 
ditional restriction on the structure and elements of B,B m . Because B is 
unknown this assumption may not be realistic in some applications. 

The case where B is completely unknown is treated in [172] using the 
adaptive control law 

u = —L(t)K(t)x + L(t)r 

K = B^ n Pex T , L = —LB^evJ L (6.2.40) 

The result established, however, is only local which indicates that for stabil- 
ity K{ 0), L( 0) have to be chosen close to the equilibrium K e = K * , L e = L* 
of (6.2.40). Furthermore, K*,L* are required to satisfy the matching equa- 
tions A — BL*K* = A m , BL* = B m . 

6.2.4 Nonlinear Plant 

The procedure of Sections 6.2.1 to 6.2.3 can be extended to some special 
classes of nonlinear plants as demonstrated briefly by using the following 
nonlinear example 


x = af(x) + bg(x)u 


(6.2.41) 
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where a, b are unknown scalars, f(x),g(x ) are known functions with g(x) > 
c > 0 Vi £ TZ 1 and some constant c > 0. The sgn (b) is known and f(x) 
is bounded for bounded x. It is desired that x tracks the state x m of the 
reference model given by 


X m — &m%m b m T 

for any bounded reference input signal r. 


Control Law If a, b were known, the control law 


u = 


— [klf(x) + k£x + l*r\ 

g(x) 


(6.2.42) 


with 


K = - 


a 

V 


T’ 


T = 


bm 

T 


could meet the control objective exactly. For the case of a, b unknown, we 
propose a control law of the same form as (6.2.42) but with adjustable gains, 
i.e., we use 

u = — 1 — [ki(t)f(x) + k 2 (t) x + l(t)r] (6.2.43) 

9\ x ) 


where k\,k 2 ,l, are the estimates of the unknown controller gains kl,k 2 ,l* 
respectively to be generated by an adaptive law. 


Adaptive Law As in the previous examples, we first rewrite the plant 
equation in terms of the unknown controller gains k^, l*, i.e., substituting 

for a = —bkf and adding and subtracting the term b {k\f{x) + k%x + l*r) in 
(6.2.41) and using the equation bl* = b m ,bk 2 = —a m , we obtain 

x = —a m x + b m r + b [ —k\f(x ) — k^x — Tr + g(x)u] 

If we let e = x — x m , k\ = k± — k\, k 2 = k 2 — k%, l = l — l* to be the tracking 
and parameter errors, we can show as before that the tracking error satisfies 
the differential equation 


e = -a m e + b (hif(x) + k 2 x + Ir^j 


(i = e 
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which we can use as in Section 6.2.2 to develop the adaptive laws 

h = - 7 ie/(x) sgn( 6 ) 

k -2 = — 72 ex sgn( 6 ) (6.2.44) 

l = — 73 er sgn ( 6 ) 


where 7 * >0,* = 1,2,3 are the adaptive gains. 


Analysis We can establish that all signals in the closed-loop plant (6.2.41), 
(6.2.43), and (6.2.44) are bounded and that |e(i)| = \x{t) — x m (t)\ — > 0 as 
f — > 00 by using the Lyapunov function 

ei,fe.0^ + ^l + ^l + 4 

in a similar way as in Section 6.2.2. The choice of the control law to cancel 
the nonlinearities and force the plant to behave as an LTI system is quite 
obvious for the case of the plant (6.2.41). Similar techniques may be used 
to deal with some more complicated nonlinear problems where the choice of 
the control law in the known and unknown parameter case is less obvious 
[105], 


Remark 6.2.3 The simple adaptive control schemes presented in this sec- 
tion have the following characteristics: 

(i) The adaptive laws are developed using the SPR-Lyapunov design ap- 
proach and are driven by the estimation error rather than the normal- 
ized estimation error. The estimation error is equal to the regulation 
or tracking error that is to be driven to zero as a part of the control 
objective. 

(ii) The design of the adaptive law and the stability analysis of the closed- 
loop adaptive scheme is accomplished by using a single Lyapunov func- 
tion. 

(iii) The full state vector is available for measurement. 


Another approach is to use the procedure of Chapter 4 and develop adap- 
tive laws based on the SPR-Lyapunov method that are driven by normalized 
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estimation errors. Such schemes, however, are not as easy to analyze as the 
schemes with unnornralized adaptive laws developed in this section. The 
reason is that the normalized estimation error is not simply related to the 
regulation or tracking error and additional stability arguments are needed 
to complete the analysis of the respective adaptive control scheme. 

The distinction between adaptive schemes with normalized and unnor- 
malized adaptive laws is made clear in this chapter by analyzing them in 
separate sections. Some of the advantages and disadvantages of normalized 
and unnornralized adaptive laws are discussed in the sections to follow. 

The assumption of full state measurement in the above examples is re- 
laxed in the following sections where we formulate and solve the general 
MR AC problem. 


6.3 MRC for SISO Plants 

In Section 6.2 we used several examples to illustrate the design and analysis 
of MRAC schemes for plants whose state vector is available for measurement. 
The design of MRAC schemes for plants whose output rather than the full 
state is available for measurement follows a similar procedure as that used 
in Section 6.2. This design procedure is based on combining a control law 
whose form is the same as the one we would use in the known parameter 
case with an adaptive law that provides on-line estimates for the controller 
parameters. 

In the general case, the design of the control law is not as straightforward 
as it appears to be in the case of the examples of Section 6.2. Because of 
this reason, we use this section to formulate the MRC problem for a general 
class of LTI SISO plants and solve it for the case where the plant parameters 
are known exactly. The significance of the existence of a control law that 
solves the MRC problem is twofold: First it demonstrates that given a set of 
assumptions about the plant and reference model, there is enough structural 
flexibility to meet the control objective; second, it provides the form of the 
control law that is to be combined with an adaptive law to form MRAC 
schemes in the case of unknown plant parameters to be treated in the sections 
to follow. 
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6.3.1 Problem Statement 

Consider the SISO, LTI plant described by the vector differential equation 

Xp — ApXp B p u p , — xq 

y p = C p x p (6.3.1) 

where x p £ 7 Z n ; y p . u p £ 1Z 1 and A p , B p , C p have the appropriate dimensions. 

The transfer function of the plant is given by 

V v = G p (s)up (6.3.2) 

with G p (s ) expressed in the form 

Gp{s) = kp R^) (6 ' 3 ' 3) 

where Z p ,R p are nronic polynomials and k p is a constant referred to as the 
high frequency gain. 

The reference model, selected by the designer to describe the desired 
characteristics of the plant, is described by the differential equation 

Xm = A m X m + Bm,X) X m (0 ) = X m Q 

yin = G m X m (6.3.4) 

where x m £ TZ Pm for some integer p m ',y m ,r £ 1Z 1 and r is the reference 
input which is assumed to be a uniformly bounded and piecewise continuous 
function of time. The transfer function of the reference model given by 

Vm = W m (s)r 

is expressed in the same form as (6.3.3), i.e. , 

W m (s) = k m Z ' n ^\ (6.3.5) 

Rm{s) 

where Z m (s),R m (s ) are rnonic polynomials and k m is a constant. 

The MRC objective is to determine the plant input u p so that all signals 
are bounded and the plant output y p tracks the reference model output y m 
as close as possible for any given reference input r(f) of the class defined 
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above. We refer to the problem of finding the desired u p to meet the control 
objective as the MRC problem. 

In order to meet the MRC objective with a control law that is irnple- 
mentable, i.e. , a control law that is free of differentiators and uses only 
measurable signals, we assume that the plant and reference model satisfy 
the following assumptions: 

Plant Assumptions 

PI. Z p (s ) is a rnonic Hurwitz polynomial of degree m p 
P2. An upper bound n of the degree n p of R p (s) 

P3. the relative degree n* = n p — m p of G p (s), and 
P4. the sign of the high frequency gain k p are known 

Reference Model Assumptions: 

Ml. Zrn.i.s') > 77m(^0 are monic Huiwitz polynomials of degree QrniPtni 
respectively, where p m < n. 

M2. The relative degree n* m = p m — q m of W m (s) is the same as that 
of G p (s), i.e., n* m = n* . 

Remark 6.3.1 Assumption PI requires the plant transfer function G p {s) 
to be minimum phase. We make no assumptions, however, about the 
location of the poles of G p (s), i.e., the plant is allowed to have unstable 
poles. We allow the plant to be uncontrollable or unobservable, i.e., 
we allow common zeros and poles in the plant transfer function. Be- 
cause, by assumption PI, all the plant zeros are in C~, any zero-pole 
cancellation can only occur in C~, which implies that the plant (6.3.1) 
is both stabilizable and detectable. 

The minimum phase assumption (PI) is a consequence of the control 
objective which is met by designing an MRC control law that cancels 
the zeros of the plant and replaces them with those of the reference 
model in an effort to force the closed-loop plant transfer function from 
r to y p to be equal to W rn (s). For stability, such cancellations should 
occur in C~ which implies that Z p (s ) should satisfy assumption PI. 
As we will show in Section 6.7, assumptions P3, P4 can be relaxed at 
the expense of more complex control laws. 
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6.3.2 MRC Schemes: Known Plant Parameters 


In addition to assumptions PI to P4 and Ml, M2, let us also assume 
that the plant parameters, i.e., the coefficients of G p (s) are known exactly. 
Because the plant is LTI and known, the design of the MRC scheme is 
achieved using linear system theory. 

The MRC objective is met if u p is chosen so that the closed-loop transfer 
function from r to y p has stable poles and is equal to W m (s), the transfer 
function of the reference model. Such a transfer function matching guaran- 
tees that for any reference input signal r(t), the plant output y p converges 
to y m exponentially fast. 

A trivial choice for u p is the cascade open-loop control law 


u p = C(s)r , C(s) = 


k m Z m (s) R p (s ) 


k p R !n (s) Fjj ( s ) 
which leads to the closed-loop transfer function 


Up 

r 


km Rp kpZp . . 

~r~R- 7 R =W m (s) 

l\j T ) it 777 , ZJ'f) n 77 


(6.3.6) 


(6.3.7) 


This control law, however, is feasible only when R p (s ) is Hurwitz. Other- 
wise, (6.3.7) may involve zero-pole cancellations outside C~ , which will lead 
to unbounded internal states associated with non-zero initial conditions [95]. 
In addition, (6.3.6) suffers from the usual drawbacks of open loop control 
such as deterioration of performance due to small parameter changes and 
inexact zero-pole cancellations. 

Instead of (6.3.6), let us consider the feedback control law 


u, 




A(-) 


^2 T T7 NVp + ^3 Up + c 0 r 


A(s) 


(6.3.8) 


shown in Figure 6.4 where 


a(s) = a n - 2 (s) = [s n 2 ,s n 3 ,...,s, l] T for n > 2 
a(s) = 0 for n = 1 


c(j , 01 G TZ 1 ; 61,02 e P- n ~ l are constant parameters to be designed and A (s) 
is an arbitrary rnonic Hurwitz polynomial of degree n—1 that contains Z m (s) 
as a factor, i.e., 

A(s) = A 0 (s)Z m (s) 
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Figure 6.4 Structure of the MRC scheme (6.3.8). 


which implies that Ao(s) is monic, Hurwitz and of degree no = n — 1 — q m . 
The controller parameter vector 


e* = 


n*T n*T n* * 
U 1 i y 2 i y 3 > c 0 


g n 


2 n 


is to be chosen so that the transfer function from r to y p is equal to W m (s). 

The I/O properties of the closed-loop plant shown in Figure 6.4 are 
described by the transfer function equation 


Up = G c (s)r 


(6.3.9) 


where 


G c (a) 


CpkpZpA 2 

A [(A - ^ T a(s)) R p - kpZp (Ofa(s) + 9* 3 A)] 


(6.3.10) 


We can now meet the control objective if we select the controller param- 
eters 9 J , 02 , #3 , Cq so that the closed-loop poles are stable and the closed-loop 
transfer function G c (s) = W m (s), i.e. , 


AokpZpA 2 


A [(A - Ufa) R p - kpZp (9 fa + 0|A)] 


, Z m 

— "-"171 

JA'ID. 


(6.3.11) 


is satisfied for all s G C. Because the degree of the denominator of G c (s) is 
n p + 2n — 2 and that of R m {s) is p m < n, for the matching equation (6.3.11) 
to hold, an additional n p + 2n — 2— p m zero-pole cancellations must occur in 
G c (s). Now because Z p (s) is Hurwitz by assumption and A(.s) = Ao(s)Z m (s) 
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is designed to be Hurwitz, it follows that all the zeros of G c (s ) are stable 
and therefore any zero-pole cancellation can only occur in C~ . Choosing 

c5 = k f (6.3.12) 

Kp 

and using A(s) = Ao(s)Z m (s) the matching equation (6.3.11) becomes 

(A - Ofa} R p - k p Z p (Of a + flJA) = Z p A 0 R m (6.3.13) 


or 

OlMs)Rp(s) + kp (ofa(s) + 0£A(s)) Zp{s) = A (s)R p (s) - Z p (s)A 0 (s)R m (s) 

(6.3.14) 

Equating the coefficients of the powers of s on both sides of (6.3.14), we can 
express (6.3.14) in terms of the algebraic equation 


SO* = p 


(6.3.15) 


where 0* = 


mT z)*T a * 
“l I ” 2 i ^3 


T 


S is an (n + n p — 1) x ( 2 n — 1) matrix that 
depends on the coefficients of R pi k p Z p and A, and p is an n + n p — 1 vector 
with the coefficients of AR P — Z p AoR. m . The existence of 0* to satisfy (6.3.15) 
and, therefore, (6.3.14) will very much depend on the properties of the matrix 
S. For example, if n > n p , more than one 0* will satisfy (6.3.15), whereas if 
n = n p and S is nonsingular, (6.3.15) will have only one solution. 


Remark 6.3.2 For the design of the control input (6.3.8), we assume that 
n > n p . Because the plant is known exactly, there is no need to assume 
an upper bound for the degree of the plant, i.e., because n p is known 
n can be set equal to n p . We use n > n p on purpose in order to use 
the result in the unknown plant parameter case treated in Sections 6.4 
and 6.5, where only the upper bound n for n p is known. 

Remark 6.3.3 Instead of using (6.3.15), one can solve (6.3.13) for 0\,0* 2 , 0 3 
as follows: Dividing both sides of (6.3.13) by R p (s), we obtain 
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where Q(s) (of degree n — 1 — m p ) is the quotient and k p A* (of degree 
at most n p — 1) is the remainder of Ao R m /R p , respectively. Then the 
solution for 0*, i = 1,2,3 can be found by inspection, i.e. , 

#* T a(s) = A(s) — Z p (s)Q(s ) (6.3.16) 


dfa(s) + 0* 3 A(s) 


Q(s)Rp(s) - A 0 (s)Rm,(s) 


(6.3.17) 


where the equality in the second equation is obtained by substituting 
for A*(s) using the identity A( j^ m = Q + kp r ^ . The parameters 9*,i = 
1,2,3 can now be obtained directly by equating the coefficients of the 
powers of s on both sides of (6.3.16), (6.3.17). 


Equations (6.3.16) and (6.3.17) indicate that in general the controller 
parameters 6* , i = 1,2,3 are nonlinear functions of the coefficients of 
the plant polynomials Z p (s),R p (s ) due to the dependence of Q(s) on 
the coefficients of R p (s). When n = n p and n* = 1, however, Q(s) = 1 
and the 9*’ s are linear functions of the coefficients of Z p (s), R p (s). 


Lemma 6.3.1 Let the degrees of R p ,Z p , A,Ao and R m be as specified in 
(6.3.8). Then (i) The solution 9* of (6.3.14) or (6.3.15) always exists. 

(ii) In addition if R p , Z p are coprime and n = n p , then the solution 9* is 
unique. 

Proof Let R p = R p (s)h(s) and Z p (s) = Z p (s)h(s) and R p (s),Z p (s) be coprime, 
where hfs) is a monic polynomial of degree ro (with 0 < ro < m p ). Because Z p (s) 
is Hurwitz, it follows that h(s) is also Hurwitz. If R pi Z p are coprime, h{s) = 1, i.e., 
ro = 0. If R p , Z p are not coprime, ro > 1 and h(s) is their common factor. We can 
now write (6.3.14) as 

d^aRp + k p (92 T a + 9* 3 A)Z P = A R p - Z p A 0 R m (6.3.18) 

by canceling h(s) from both sides of (6.3.14). Because h(s) is Hurwitz, the cancella- 
tion occurs in C~ . Equation (6.3.18) leads to n p +n — r 3 — 2 algebraic equations with 
2n— 1 unknowns. It can be shown that the degree of AR p — Z p A$R m is n p + n — r'o — 2 
because of the cancellation of the term s np+n ~ r o _1 . Because R p ,Z p are coprime, 
it follows from Theorem 2.3.1 that there exists unique polynomials ao(s),bo(s) of 
degree n — 2, n p — ro — 1 respectively such that 

a 0 (s)R p (s) + b 0 (s)Z p (s ) = A (s)R p (s) - Z p (s)A 0 (s)R m (s) 


(6.3.19) 
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is satisfied for n > 2. It now follows by inspection that 

^ T a(s) = f(a)Z p (a) + a 0 (s) (6.3.20) 

and 

k p (9* 2 T a(s) + e* 3 A (s)) = —f(s)R p (s) + b 0 (s) (6.3.21) 

satisfy (6.3.18), where f(s) is any given polynomial of degree n/ = n — n p + ro — 1. 
Hence, the solution 9\, 0 2 , 0 3 of (6.3.18) can be obtained as follows: We first solve 
(6.3.19) for ao(s),bo(s). We then choose an arbitrary polynomial f(s) of degree 
nf = n — rip + ro — 1 and calculate 9f,9 2 ,9 3 from (6.3.20), (6.3.21) by equating 
coefficients of the powers of s. Because f(s) is arbitrary, the solution 9\, 0 2 , d 3 is 
not unique. If, however, n = n p and ro = 0, i.e., R p , Z p are coprime, then f(s) = 0 
and dJ T a(s) = ao(s), k p (9 2 T a(s) + 0|A(s)) = &o(s) which implies that the solution 
9l,6 2 ,9 3 is unique due to the uniqueness of ao(s), 6o(s). If n = n p = 1, then 
a(s) = 0, A(s) = 1, 91 = 9 2 = 0 and 0 3 given by (6.3.18) is unique. □ 


Remark 6.3.4 It is clear from (6.3.12), (6.3.13) that the control law (6.3.8) 
places the poles of the closed-loop plant at the roots of the polynomial 
Z p (s)A 0 (s)R m (s) and changes the high frequency gain from k p to k m 
by using the feedforward gain Cq. Therefore, the MRC scheme can be 
viewed as a special case of a general pole placement scheme where the 
desired closed-loop characteristic equation is given by 

Zp(s)A 0 (s)R m (s) = 0 

The transfer function matching (6.3.11) is achieved by canceling the 
zeros of the plant, i.e., Z p (s), and replacing them by those of the 
reference model, i.e., by designing A = Ao Z m . Such a cancellation 
is made possible by assuming that Z p (s) is Hurwitz and by designing 
Ao, Z m to have stable zeros. 

We have shown that the control law (6.3.8) guarantees that the closed- 
loop transfer function G c (s ) of the plant from r to y p has all its poles in C~ 
and in addition, G c (s ) = W m (s). In our analysis we assumed zero initial 
conditions for the plant, reference model and filters. The transfer function 
matching, i.e., G c (s) = W m (s), together with zero initial conditions guar- 
antee that y p (t) = y m (t),\/t > 0 and for any reference input r(t) that is 
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bounded and piecewise continuous. The assumption of zero initial condi- 
tions is common in most I/O control design approaches for LTI systems and 
is valid provided that any zero-pole cancellation in the closed-loop plant 
transfer function occurs in C~ . Otherwise nonzero initial conditions may 
lead to unbounded internal states that correspond to zero-pole cancellations 
in C + . 

In our design we make sure that all cancellations in G c {s ) occur in C~ 
by assuming stable zeros for the plant transfer function and by using stable 
filters in the control law. Nonzero initial conditions, however, will affect the 
transient response of y p (t). As a result we can no longer guarantee that 
y p (t) = y m (t) Vt > 0 but instead that y p (t) — > y m (t ) exponentially fast with 
a rate that depends on the closed-loop dynamics. We analyze the effect of 
initial conditions by using state space representations for the plant, reference 
model, and controller as follows: We begin with the following state-space 
realization of the control law (6.3.8): 


chi 

= Fuji + gu p , 

(0) = 0 


= Fuj 2 + gy p , 

^2(0) = 0 

Up 

= e* T uj 



where ui \ , uj 2 


t /v , 


^i T > ^2 T > c o > w — [ w i > w 2 > Upi r ] 


T , ,T 


lT 


(6.3.22) 



— A n - 2 

— A n - 3 

An— 4 • • • 

— Ao 


' 1 ' 
0 


1 

0 

0 

0 


F = 

0 

1 

0 

0 

, 9 = 


0 

0 

1 

0 


0 


A i are the coefficients of 


(6.3.23) 


A(s) = s n 1 + \ n - 2 s n 2 + . . . + Ais + Ao = det (si - F ) 


and (F, g) is the state space realization of i.e. , {si — F)~ l g = The 
block diagram of the closed-loop plant with the control law (6.3.22) is shown 
in Figure 6.5. 
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Figure 6.5 Block diagram of the MRC scheme (6.3.22). 


We obtain the state-space representation of the overall closed-loop plant 
by augmenting the state x p of the plant (6.3.1) with the states ui,U 2 of the 
controller (6.3.22), i.e., 


F c = A c Y c + B cC * 0 r, y c (0) = Yo 

Vp = CjY c 


(6.3.24) 


where 


Y r = 


Xp,uJ,uJ 


lT 


e n np 


n v -\-2n—2 



' A p + B p 0%Cj 

JD /3*T 
-DpUl 

td n*T 
UpV 2 


B p 

A c = 

S C P 

F + gdf 

0 

gof 

F 

, B c = 

g 

0 


(6.3.25) 


C'e' = 


cJ,o,o 


and Yq is the vector with initial conditions. We have already established 
that the transfer function from r to y p is given by 


Vp(s) 


c* 0 k p Z p A 2 


r(s) A [(A - Of a) R p - k p Z p (6fa + 0$ A)] 
which implies that 

Cj (si - A.)- 1 B c Cq = 


= W m (s) 


c* 0 k p Z p A 2 


A [(A - Of a) R p - k p Z p ( 6* T a + 0£A)] 


= W m (s) 
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and, therefore, 

det (si - A c ) = A [(A - Of a) R p - k p Z p (e * 2 T a + 05 A)] = AZ p A 0 R m 

where the last equality is obtained by using the matching equation (6.3.13). 
It is clear that the eigenvalues of A c are equal to the roots of the polynomials 
A, Z p and R m ] therefore, A c is a stable matrix. The stability of A c and the 
boundedness of r imply that the state vector Y c in (6.3.24) is bounded. 

Since Cj (si — A c )~ 1 B c Cq = W m (s), the reference model may be realized 
by the triple (A c , B c Cq, C c ) and described by the nonminimal state space 
representation 

lm = A c Y m + BqCqV, 4 m (0) = Y m i o 

Vm = Cjy m (6.3.26) 

where Y m € 7 z n p +2n ~ 2 . 

Letting e = Y c — Y rn to be the state error and e\ = y p — y m the output 
tracking error, it follows from (6.3.24) and (6.3.26) that 

e = A c e, e\ = Cj e 

i.e. , the tracking error e\ satisfies 

ei = Cje A ^ (y c (0) - y m (0)) 

Because A c is a stable matrix, e±(t) converges exponentially to zero. The 
rate of convergence depends on the location of the eigenvalues of A c , which 
are equal to the roots of A(s)Ao(s)R m (s)Z p (s) = 0. We can affect the rate 
of convergence by designing A(s)Ao(s)R m (s) to have fast zeros, but we are 
limited by the dependence of A c on the zeros of Z p (s), which are fixed by 
the given plant. 

Example 6.3.1 Let us consider the second order plant 

—2 (s + 5) —2 (s + 5) 

Vp ~ s 2 ~ 2s + l Up _ ( s _l) 2 Up 

and the reference model 

3 

Vm = — P r 

s + 3 
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The order of the plant is n p = 2. Its relative degree n* = 1 is equal to that of the 
reference model. 

We choose the polynomial A(s) as 

A(s) = s + 1 = A 0 (s) 

and the control input 

Up = e\ ^ + 0 *3 yp + c *o r 

which gives the closed-loop transfer function 

Vp = -2cg (s + 5) (s + 1) = Q , s 

r (s + l-0t)(s-l ) 2 + 2(s + 5)(0* + 0*(s + l)) c[S 

Forcing G c (s ) = 3/(s + 3), we have cjj = —3/2 and the matching equation (6.3.14) 
becomes 

6 >* (s - l ) 2 - 2 (02 + 0* 3 (s + 1)) (s + 5) = (s + 1) (s - l ) 2 - (s + 5) (s + 1) (s + 3) 


i.e., 

{91 - 291 ) s 2 + (- 20 i - 20 2 - 120^) s + 9 { - 10 ( 6»2 + 91 ) = -10s 2 - 24s - 14 
Equating the powers of s we have 

0 \ - 2 0 * = -10 
0i + 0; + 66I3 = 12 

9 \ - 1002 - 1003 = ~ 14 


i.e., 



which gives 



The control input is therefore given by 

u p = -4 -^—ru p - 2 -^—y P + 3 y p - 1.5r 
s + 1 s + 1 

and is implemented as follows: 


LOi = -UJ 1 + Up, U>2 = -UJ2 + Up 


Up = —4 u>i — 2 u >2 + 3 y p — 1.5r 


V 
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Figure 6.6 Implementation of the MRC scheme (6.3.28). 


Remark 6.3.5 Because 6* is a constant vector, the control law (6.3.8) may 
be also written as 

Up = ( 0 * u p) + (02 Up) + 03 Up + c o r (6.3.27) 

and implemented as 

chi = F T i0\ + 0\u p 

ch 2 = F T UJ 2 + 0*2 Up (6.3.28) 

Up = g T co i + g T LV 2 + 0lu p + c* 0 r 

where ui,u )2 £ ; F,g are as defined in (6.3.23), i.e., g T (sI — 

F t )~ l = g T ((sI-F)~ 1 ) T = uiprp The block diagram for implement- 
ing (6.3.28) is shown in Figure 6.6. 

Remark 6.3.6 The structure of the feedback control law (6.3.8) is not 
unique. For example, instead of the control law (6.3.8) we can also 



use 


(6.3.29) 
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where Ai(s) = Ao (s)Z m (s) has degree n,a(s ) = a n _i(s) = [s n *, 

n_2 ° 1lT and 9\, 6 * £ 7Z n . The overall desired controller paranr- 

, T 

2 > c 0 


S'" S, 1] T 

eter 9* = 


/)*T zd*T 


£ 7\l 2n+1 has one dimension more than that 
in (6.3.8). The analysis of (6.3.29) is very similar to that of (6.3.8) and 
is left as an exercise (see Problem 6.7) for the reader. 


Remark 6.3.7 We can express the control law (6.3.8) in the general feed- 
back form shown in Figure 6.7 with a feedforward block 


C(s) 


c*oMs) 

A(s) — 0f T a(s) 


and a feedback block 


F(s) = 


a(s) + ^3 A(s) 
A (s)c* 0 


where Cq, 6\, 9\, 9 3 are chosen to satisfy the matching equation (6.3.12), 
(6.3.13). The general structure of Figure 6.7 allows us to analyze and 
study properties such as robustness, disturbance rejection, etc., of the 
MRC scheme using well established results from linear system theory 
[57, 95]. 


We have shown that the MRC law (6.3.8), whose parameters 9*,i = 1,2,3 
and Cq are calculated using the matching equations (6.3.12), (6.3.13) meets 
the MRC objective. The solution of the matching equations for 9* and Cq 
requires the knowledge of the coefficients of the plant polynomials k p Z p (s), 
R p (s). In the following sections we combine the MRC law (6.3.8) with an 
adaptive law that generates estimates for 9 * , Cq on-line to deal with the case 
of unknown plant parameters. 


6.4 Direct MRAC with Unnormalized Adaptive 
Laws 

The main characteristics of the simple MRAC schemes developed in Section 
6.2 are 

(i) The adaptive laws are driven by the estimation error which, due to the 
special form of the control law, is equal to the regulation or tracking 
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Figure 6.7 MRC in the general feedback block diagram where C(s) = 

cgAQQ ip/ \ _ dp a(s)+8*A(s) 

A(s)-et T a(s)’ r ^> C* A(s) 

error. They are derived using the SPR-Lyapunov design approach 
without the use of normalization 

(ii) A simple Lyapunov function is used to design the adaptive law and 
establish boundedness for all signals in the closed-loop plant. 

The extension of the results of Sections 6.2, 6.3 to the SISO plant (6.3.1) 
with unknown parameters became an active area of research in the 70’s. 
In 1974, Monopoli [150] introduced the concept of the augmented error, a 
form of an estimation error without normalization, that he used to develop 
stable MRAC schemes for plants with relative degree 1 and 2, but not for 
plants with higher relative degree. Following the work of Monopoli, Feuer 
and Morse [54] designed and analyzed MRAC schemes with unnormalized 
adaptive laws that are applicable to plants with known relative degree of ar- 
bitrary positive value. This generalization came at the expense of additional 
complexity in the structure of MRAC schemes for plants with relative de- 
gree higher than 2. The complexity of the Feuer and Morse MRAC schemes 
relative to the ones using adaptive laws with normalized estimation errors, 
introduced during the same period [48, 72, 174], was responsible for their 
lack of popularity within the adaptive control research community. As a 
result, it was not until the early 1990s that the MRAC schemes designed 
for plants with relative degree higher than 2 and employing unnormalized 
adaptive laws were revived again and shown to offer some advantages over 
the MRAC with normalization when applied to certain classes of nonlinear 
plants [98, 116, 117, 118, 162], 

In this section, we follow an approach very similar to that of Feuer and 
Morse and extend the results of Section 6.2 to the general case of higher 
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order SISO plants. The complexity of the schemes increases with the relative 
degree n* of the plant. The simplest cases are the ones where n* = 1 and 2. 
Because of their simplicity, they are still quite popular in the literature of 
continuous-time MRAC and are presented in separate sections. 


6.4.1 Relative Degree n* = 1 

Let us assume that the relative degree of the plant 

y p = G p (s)u p = kp^j^Up 
is n* = 1. The reference model 


(6.4.1) 


Vm = W m (s)r 

is chosen to have the same relative degree and both G p (s),lT m (s) satisfy 
assumptions PI to P4, and Ml and M2, respectively. In addition W m (s ) is 
designed to be SPR. 

The design of the MRAC law to meet the control objective defined in 
Section 6.3.1 proceeds as follows: 

We have shown in Section 6.3.2 that the control law 


io\ = Fui\ + gu p , uq(0) = 0 

L 02 = Fuj 2 + gy P , cu 2 (0) = 0 (6.4.2) 

Up = 9 * t lo 


where co = 


CO 


vj,ypi r 


and 6* = 


Q*T /3*T n* 

h T2 


T 


calculated from the 


matching equation (6.3.12) and (6.3.13) meets the MRC objective defined in 
Section 6.3.1. Because the parameters of the plant are unknown, the desired 
controller parameter vector 9* cannot be calculated from the matching equa- 
tion and therefore (6.4.2) cannot be implemented. A reasonable approach 
to follow in the unknown plant parameter case is to replace (6.4.2) with the 
control law 


co\ = Fcoi+gUp, uq(0) = 0 

lu 2 = Fuj2+gy P , w 2 (0) = 0 

Up = 9 t co 


(6.4.3) 
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where 9(t) is the estimate of 9* at time t to be generated by an appropri- 
ate adaptive law. We derive such an adaptive law by following a similar 
procedure as in the case of the examples of Section 6.2. We first obtain a 
composite state space representation of the plant and controller, i.e. , 

Y c = AqY c + B c u p , 4^(0) = Yq 

y P = cJy c 

Up = 6 t u> 

r T T t1 t 

where Y c = x p , uq , oq 

A p 0 0 ' 

A) = 0 F 0 , B c 

_gCj 0 F 

Cj = [c p T ,o,o‘ 

and then add and subtract the desired input B c 9* t lu to obtain 

Y c = A 0 Y C + B c 9* r u + B c (u p - 9* T u?j 

If we now absorb the term B c 9* T co into the homogeneous part of the above 
equation, we end up with the representation 

Y c = A C Y C + B c c* 0 r + B c (u p - 9 * t cj) , Y c (0) = Y (J 

V P = CjY c (6.4.4) 

where A c is as defined in (6.3.25). Equation (6.4.4) is the same as the closed- 
loop equation (6.3.24) in the known parameter case except for the additional 
input term B c {u p — 9* T u) that depends on the choice of the input u p . It 
serves as the parameterization of the plant equation in terms of the desired 
controller parameter vector 9*. Let e = Y c — Y m and e\ = y p — y m where Y m 
is the state of the nonminimal representation of the reference model given 
by (6.3.26), we obtain the error equation 

e = A c e + B c {u p — 9 * t lu), e(0) = eo 

e\ = Cj e 



(6.4.5) 
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Because 

Cj{sI-A c )- l B c cl = W m {s) 

we have 

ei = W m (s)p* (u p - rV) (6.4.6) 

where p* = 4r, which is in the form of the bilinear parametric model analyzed 

c o 

in Chapter 4. We can now use (6.4.6) to generate a wide class of adaptive 
laws for estimating 9* by using the results of Chapter 4. We should note 
that (6.4.5) and (6.4.6) hold for any relative degree and will also be used in 
later sections. 

The estimate e\(t) of ei(t) based on 9(t), the estimate of 9* at time t, is 
given by 

ei = W m (s)p (u p - 9 t u?J (6.4.7) 

where p is the estimate of p* . Because the control input is given by 

Up = 9 T (t) to 

it follows that e\ = W m (s)[0]; therefore, the estimation error e± defined in 
Chapter 4 as e\ = e\ — e\ may be taken to be equal to e\, i.e., e\ = e\. 
Consequently, (6.4.7) is not needed and the estimate p of p* does not have 
to be generated. Substituting for the control law in (6.4.5), we obtain the 
error equation 


e = A c e + B c p*9 J \o, e(0) = eo 
e\ = Cje (6.4.8) 


where 

B c = B c c* 0 
or 

ei = W m (s)p*9 T co 

which relates the parameter error 9 = 9(t) — 9* with the tracking error e\. 
Because W m (s ) = Cj (si — A c )~ 1 B c Cq is SPR and A c is stable, equation 
(6.4.8) is in the appropriate form for applying the SPR-Lyapunov design 
approach. 
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We therefore proceed by proposing the Lyapunov-like function 


V 


(M 


e T P r e 0 T r~b 


Ip* 


2 2 

where T = T t > 0 and P c = Pj > 0 satisfies the algebraic equations 

P C A C + A] P c = -qq T - v c L c 
P C B C = C c 


(6.4.9) 


where q is a vector, L c = Lj > 0 and v c > 0 is a small constant, that are 
implied by the MKY lemma. The time derivative V of V along the solution 
of (6.4.8) is given by 

V = - G q 2 6 - ^e T L c e + e T P c B c p*9 T lu + 6 T T~ l 6\p*\ 

Because e T P C B C = ei and p* = |/9*|sgn(p*), we can make V < 0 by choosing 


6 = 9 = —Te sgn (p*) 


(6.4.10) 


which leads to 


V=- 


e T qq T e 


"c t T 
— — e L r e 


(6.4.11) 


Equations (6.4.9) and (6.4.11) imply that V and, therefore, e, 0 £ £oo- 

Because e = Y c — Y m and Y rn £ Coo-, w e have Y c £ Coo, which implies that 
t/p, UJ\,UJ2 £ C oo- Because Up — 0 lo and 6,uj £ Coo we also have Up £ Coo- 
Therefore all the signals in the closed-loop plant are bounded. It remains to 
show that the tracking error e\ = y p — y rn goes to zero as t — > oo. 

From (6.4.9) and (6.4.11) we establish that e and therefore e\ £ C 2 - 
Furthermore, using 6,co,e £ C oo in (6.4.8) we have that e,e\ £ Coo- Hence, 
ei,ei £ Coo and e\ £ C 2 , which, by Lemma 3.2.5, imply that e\ (t) — > 0 as 
t — > 00 . 

We summarize the main equations of the MRAC scheme in Table 6.1. 

The stability properties of the MRAC scheme of Table 6.1 are given by 
the following theorem. 


Theorem 6.4.1 The MRAC scheme summarized in Table 6.1 guarantees 
that: 
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Table 6.1 MR AC scheme: n* = 1 


Plant 

Vp ~ k PR p (s) U P’ n ~ 1 

Reference 

model 

Urn = W m (s)r, W m (s ) = km. R 2{s] 

Control law 

6j\ = Fuji + gu p , ur(0) = 0 
uj 2 = Fuj 2 + gy p , w 2 (0) = 0 
u p = 8 T t o 

UJ = [ujJ, ujJ , y p ,r\ T ,UJI G 7?™ -1 , uj 2 G IZ n ~ l 

Adaptive law 

6 = — Teiuj sgn (p*) 

ei =y P ~ y m , sgn (p*) = sgn {k p /k m ) 

Assumptions 

Zp,R p and W m (s) satisfy assumptions PI to P4, 
and Ml and M2, respectively; W m (s) is SPR; (si — 

= Xf5J’ a (' s ) = [s n_2 ,s n_3 ,---s,l] T , where 
A = Ao Z m is Hurwitz, and Ao(s) is of degree n—l—q m 
q rn is the degree of Z m (s ); T = T t > 0 is arbitrary 


(i) 


All signals in the closed-loop plant are bounded and the tracking error 
e\ converges to zero asymptotically with time for any reference input 


(ii) If r is sufficiently rich of order 2 n, f G Coo and Z p (s),R p (s ) are rela- 
tively coprime, then the parameter error |#| = \9 — 9*\ and the tracking 
error e\ converge to zero exponentially fast. 

Proof (i) This part has already been completed above. 

(ii) Equations (6.4.8) and (6.4.10) have the same form as (4.3.30) and (4.3.35) 
with n 2 s = 0 in Chapter 4 whose convergence properties are established by Corollary 
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4.3.1. Therefore, by using the same steps as in the proof of Corollary 4.3.1 we can 
establish that if ui, to G £oo and u> is PE then 6(t) — > 0 exponentially fast. If r G 
then it follows from the results of part (i) that to G C^. For the proof to be 
complete, it remains to show that u> is PE. 

We express ui as 

{sI-F)-'gG-\s)y p ' 

( S/ - F )" gy r (6.4.12) 

y P 

r 

Because y p = y m + e\ = W m (s)r + e± we have 

UI = LUm + u> (6.4.13) 



where 

and 

H{s) 


u) m = H(s)r , w = Ef 0 (s)ei 


(si - F) 1 gG p 1 (s)W m (s) 
(si — F)~ 1 gW m (s) 
W m (s ) 

1 


Ho(s) 


(. sI-F)-igG~\s ) 
(sI-Fy+g 
1 
0 


The vector u> is the output of a proper transfer matrix whose poles are stable and 
whose input ei € C 2 D and goes to zero as t — > oo. Hence, from Corollary 3.3.1 
we have u G C 2 D and |a>(t) | — > 0 as t — > oo. It then follows from Lemma 4.8.3 
that ui is PE if ui m is PE. 

It remains to show that ui m is PE when r is sufficiently rich of order 2 n. 

Because r is sufficiently rich of order 2 n, according to Theorem 5.2.1, we can 
show that ui m is PE by proving that H(jui i), H(jui 2 ), . . . ,H(ju> 2n ) are linearly in- 
dependent on C 2n for any u>i, ui 2 , ■ ■ ■ , ui 2n G R with u>i yf uij for i ^ j. 

From the definition of H(s ), we can write 


H(s) = 


cr ( s ) Rp (s) k m Zjn ( s) 

1 oc(s')kpZp(s')k rn Z rn (s ) A 1 

kpZp(s)A(s)R m (s) A(s)kpZ p (s)k m Z m (s) kpZp(s)A(s)R m (s) 


\ A (s)k p Z p (s)R m (s) J 

(6.4.14) 

Because all the elements of Hi(s) are polynomials of s with order less than or equal 
to that of A (s)Z p (s)R m (s), we can write 


Hi(s)=H 


(6.4.15) 


1 
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where l = 2n — 1 + q m is the order of the polynomial A (s)Z p (s)R m (s), q m is the 
degree of Z m (s) and H G 7 j 2 nx(z+i) ; g a constant matrix. 

We now prove by contradiction that H in (6.4.15) is of full rank, i.e., rank(H) = 
2 n. Suppose rank(H) < 2 n, i.e., there exists a constant vector C G lZ 2n with C / 0 
such that 

C T H = 0 

or equivalently 

C T Hi(s) = 0 (6.4.16) 

for all s G C. Let C = [Cj,Cj,c 3 ,C 4 ] T , where C \ , C 2 G 7 \L” - 1 ,c 3 ,C 4 € TZ 1 , then 

(6.4.16) can be written as 

C i o / '{.s)Rp[s')k rn Z rn ( < s') -\- C 2 Q{^s)kpZrp(^s')k rn Z rn ( < s') 
+c 3 A(s)k p Zp(s)k m Z rn (s) + c 4 A(s)R m (s)k p Z p {s) = 0 (6.4.17) 

Because the leading coefficient of the polynomial on the left hand side is C 4 , for 

(6.4.17) to hold, it is necessary that C 4 = 0. Therefore, 

[C'T" a(s)R p (s) + Cj a(s)k p Z p (s) + c 3 A(s)kpZ p (s)]k m Z m (s) = 0 
or equivalently 

Cj a(s)R p (s) + Cj a{s)k p Zp(s) + c 3 R{s)kpZ p (s) = 0 (6.4.18) 

Equation (6.4.18) implies that 


, Zp(s) _ Cj a(s) 

p R p (s ) c 3 A(s) + Cj a(s) 


(6.4.19) 


Noting that c 3 A(s) + Cj a(s ) is of order at most equal to n — 1, (6.4.19) contradicts 
our assumption that Z p (s), R p (s) are coprime. Therefore H must be of full rank. 

Now consider the 2 n x 2 n matrix L(u>i , . . . , io^n) = , . . . , H(ju>2n)\- 

Using (6.4.14) and (6.4.15), we can express L(wi,w 2 , • • ■ , u> 2 n) as 


(M) z (ju 2 y ■■■ (ju 2n) 1 

(jWl ) i_1 ■■■ (j^n ) 1 ^ 1 

L(u>i, . . . , u>2n) = H 

1 1 ... 1 
r i 0 

D{ju 1 ) U 

® D(ju 2 ) 

X 

0 

0 0 


0 

(6.4.20) 

0 

1 

D(ju> 2„) _ 
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where D(s ) = k p Z p (s)A(s)R m (s). Note that the matrix in the middle of the right- 
hand side of (6.4.20) is a submatrix of the Vandermonte matrix, which is always 
nonsingular for uj-i ^ W/ c , i ^ k\i,k = l,...,2n. We, therefore, conclude from 
(6.4.20) that L(u i, . . . ,ui2n) is of full rank which implies that H{ju> i), . . . ,H(jco 2n) 
are linearly independent on C 2n and the proof is complete. □ 


Example 6.4.1 Let us consider the second order plant 

k p {s + b 0 ) 

Up — / 2 i i \ U P 
(s z + a 3 s + a 0 ) 

where k p > 0,bo > 0 and A p ,6o> a ii a o are unknown constants. The desired perfor- 
mance of the plant is specified by the reference model 

1 


Using Table 6.1, the control law is designed as 

uj\ = — 2uq T Up, uq(0) — 0 
L02 = — 2u>2 + y p , <^2(0) = 0 
Up = diLOl + 02UJ2 + d 3 y p + c 0 r 

by choosing F = —2 ,g = 1 and A(s) = s + 2. The adaptive law is given by 

9 = -Te iw, 6»(0) = e 0 

where e\ — y p — ym,9 = [0\, 62 , 63 , Co] T and u> = [uJi,ix> 2 ,y P ,r\ V . We can choose 
T = diag{ 7 i} for some 7 i > 0 and obtain the decoupled adaptive law 

0i = - 'YieitJi, i = 1, - - - , 4 

where 04 = co,oj 3 = y P ,u > 4 = r ; or we can choose T to be any positive definite 
matrix. 

For parameter convergence, we choose r to be sufficiently rich of order 4. 
As an example, we select r = Ai sintuif + A. 2 sinw 2 t for some nonzero constants 
A±, A 2 , cui, u >2 with uji ^ u> 2 - We should emphasize that we may not always have 
the luxury to choose r to be sufficiently rich. For example, if the control objective 
requires r =constant in order for y p to follow a constant set point at steady state, 
then the use of a sufficiently rich input r of order 4 will destroy the desired tracking 
properties of the closed-loop plant. 

The simulation results for the MR AC scheme for the plant with b 3 = 3, ai = 3, 
a 0 = —10, k p = 1 are shown in Figures 6.8 and 6.9. The initial value of the 
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Figure 6.8 Response of the MRAC scheme for Example 6.4.1 with r(t) = 
unit step function. 


parameters are chosen as 0(0) = [3, —10, 2, 3] T . Figure 6.8(a, b) shows the response 
of the tracking error e± and estimated parameter error 9t for 7, = 1 and r = unit 
step . Figure 6.9 shows the simulation results for V = diag{2, 6, 6, 2} and r = 
0.5sin0.7t + 2cos5.9f. From Figure 6.9 (b), we note that the estimated parameters 
converge to 9* = [1, —12, 0, 1] T due to the use of a sufficiently rich input. V 


Remark 6.4.1 The error equation (6.4.8) takes into account the initial con- 
ditions of the plant states. Therefore the results of Theorem 6.4.1 hold 



parameter error tracking error 
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Figure 6.9 Response of the MRAC scheme for Example 6.4.1 with r(t) = 
0.5 sin 0.7t + 2 cos 5.9f. 

for any finite initial condition for the states of the plant and filters. 
In the analysis, we implicitly assumed that the nonlinear differential 
equations (6.4.8) and (6.4.10) with initial conditions e(0) = eo,#(0) = 
6o possess a unique solution. For our analysis to be valid, the solution 
9(t),e(t) has to exist for all t G [0, oo). The existence and uniqueness 
of solutions of adaptive control systems is addressed in [191]. 

Remark 6.4.2 The proof of Theorem 6.4.1 part (i) may be performed by 
using a minimal state-space representation for the equation 


ei = W m {s)p*0 T u 


(6.4.21) 
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rather than the nonminimal state representation (6.4.8) to develop the 
adaptive law 6 = — Te\uj. In this case we establish that e\, 0 G £oo and 
e\ G £2 by using the LKY ( instead of the MKY Lemma ) and the 
properties of a Lyapunov-like function. The boundedness of e\ implies 
that y p G £00 • 

The boundedness of ui and the rest of the signals requires the following 
additional arguments: We write uj as 

(si - F)- 1 gG~ 1 (s)y p 
(sI-F)- { gy p 

UJ — 

y P 

r 

Because y p G £00 and (si — F)~ 1 gG~ 1 (s ), (si — F)~ 1 g are proper 
(note that the relative degree of G p (s ) is 1) with stable poles, we have 
to G £ 00 - From u p = 9 t uj and 9,u> G £«,, it follows that u p G £00 • 
The proof of ei(f) — > 0 as t — ► 00 follows by applying Lemma 3.2.5 and 
using the properties ei G £ 2 , ei = sW m (s)p*9 [ lo G £ 00 • 

Remark 6.4.3 The effect of initial conditions may be accounted for by 
considering 


ei = W m (s)p*0 T u + Cj (si - A c )~ 1 e( 0) 

instead of (6.4.21). Because the term that depends on e(0) is exponen- 
tially decaying to zero, it does not affect the stability results. This can 
be shown by modifying the Lyapunov-like function to accommodate 
the exponentially decaying to zero term (see Problem 6.19). 

6.4.2 Relative Degree n* = 2 

Let us consider again the parameterization of the plant in terms of 9*, de- 
veloped in the previous section, i.e., 


e = A c e + B c [u p — 9* T u?j 
ei = Cje 


(6.4.22) 
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or 

ei = W m (s)p* (u p - r T w) (6.4.23) 

In the relative degree n* = 1 case, we are able to design W m (s) to be SPR 
which together with the control law u p = 9 t lu enables us to obtain an error 
equation that is suitable for applying the SPR-Lyapunov design method. 

With n* = 2, W rn (s) can no longer be designed to be SPR and therefore 
the procedure of Section 6.4.1 fails to apply here. 

Instead, let us follow the techniques of Chapter 4 and use the identity 
(s + Po)(s + po) -1 = 1 for some po > 0 to rewrite (6.4.22), (6.4.23) as 

e = A c e + B c (s + p 0 ) p* (uf - 0* T (fr) , e(0) = e 0 

ei = Cje (6.4.24) 


i.e., 


where B c = B c Cq, 


ei = W m (s) (s + p 0 ) p* (u f - 0* T </>) 


(6.4.25) 


1 


Uf = 


4 * — 


l 


-LO 


s + Po s + po 

and W m (s),p 0 > 0 are chosen so that W m {s) ( s + po) is SPR. 

We use p, 9, the estimate of p*,9 *, respectively, to generate the estimate 
e\ of e\ as 

ei = W m (s)(s + po)p(uf - 0 T (/)) 


If we follow the same procedure as in Section 6.4.2, then the next step is 
to choose Up so that e\ = W m (s)(s + po)[0], ei = e.\ and (6.4.24) is in the 
form of the error equation (6.4.8) where the tracking error e\ is related to 
the parameter error 9 through an SPR transfer function. The control law 
Up = O t lu (used in the case of n* = 1 ) motivated from the known parameter 
case cannot transform (6.4.23) into the error equation we are looking for. 
Instead if we choose u p so that 


Uf = 9 T cj) 


(6.4.26) 


we have e\ = VP m (s)(s + po)[0] and by substituting (6.4.26) in (6.4.24), we 
obtain the error equation 


e = A c e + B c (s + p 0 ) p*9 T (j), e(0) = e 0 

ei = Cj e 


(6.4.27) 
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or in the transfer function form 

ei = W m (s) (s + po) P*0 T 4> 

which can be transformed into the desired form by using the transformation 

e = e - B c p*6 T cj) (6.4.28) 


i.e., 

e = A c e + Bip*6 T <j), e(0) = eo 
ei = Cje (6.4.29) 

where B\ = A C B C + B c Po and Cj B c = Cj B p Cq = 0 due to n* = 2. With 
(6.4.29), we can proceed as in the case of n* = 1 and develop an adaptive 
law for 6. Let us first examine whether we can choose u p to satisfy equation 
(6.4.26). We have 

Up = (s + po) u f = (s + po) 8 t 4> 

which implies that 

Up = 0 T u> + d T cj) (6.4.30) 

Because 9 is made available by the adaptive law, the control law (6.4.30) 
can be implemented without the use of differentiators. Let us now go back 
to the error equation (6.4.29). Because 

Cj (si - A?)- 1 B, = Cj (si - Ac)- 1 B c (s + po) = W m (s) (s + p 0 ) 


is SPR, (6.4.29) is of the same form as (6.4.8) and the adaptive law for 
generating 9 is designed by considering 


V 



e T P c e 

2 


2 


Ip* 


where P c = Pj > 0 satisfies the MKY Lemma. As in the case of n* = 1, for 


9 = 9= —Tei4> s g n (p*) 


(6.4.31) 


the time derivative V of V along the solution of (6.4.29), (6.4.31) is given by 
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which implies that e,0,e i £ £oo and e, e\ £ £ 2 . Because e\ = y p — y m > we 
also have £ £ 00 . The signal vector f is expressed as 

(s/ - F)~ 1 gG~ 1 {s)y p 
{si - F)~ f gy p 

Up 
r 

by using u p = G p 1 {s)y p . We can observe that each element of f> is the 
output of a proper stable transfer function whose input is y p or r. Because 
y p ,r £ Coo we have f £ £«>• Now e, 9,(j) £ Coo imply (from (6.4.28) ) that 
e and, therefore, Y c £ Coo- Because Lu,(f>,e 1 £ £oo we have 6 £ £oo and 
u p £ Coo and therefore all signals in the closed -loop plant are bounded. 
From (6.4.29) we also have that e £ £oo, i.e. , e\ £ Coo , which, together with 
ei £ Coo FI £ 2 ; implies that ei(t) — » 0 as t — > 00 . 

We present the main equations of the overall MRAC scheme in Table 6.2 
and summarize its stability properties by the following theorem. 

Theorem 6.4.2 The MRAC scheme of Table 6.2 guarantees that 

(i) All signals in the closed-loop plant are bounded and the tracking error 
e\ converges to zero asymptotically. 

(ii) If R p , Z p are coprime and r is sufficiently rich of order 2 n, then the 
parameter error \0\ = \9 — 0*\ and the tracking error e± converge to 
zero exponentially fast. 

Proof (i) This part has been completed above. 

(ii) Consider the error equations (6.4.29), (6.4.31) which have the same form as 
equations (4.3.10), (4.3.35) with n 2 s = 0 in Chapter 4. Using Corollary 4.3.1 we have 
that if £ Coo and <f> is PE, then the adaptive law (6.4.31) guarantees that \6\ 
converges to zero exponentially fast. We have already established that y p , <j> £ Coo- 
It follows from (6.4.32) and the fact that ei and therefore y p £ Coo that £ Too- 
Hence it remains to show that f is PE. 

As in the case of n* = 1 we write <\> as 

f = <t>m + <f> 


(6.4.32) 
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Table 6.2 MR AC scheme: n* = 2 


Plant 

Up - k PRp(s) U P’ n — 2 

Reference 

model 

Urn = w m (s)r, W m (s) = 

Control law 

tci = Fuji + gu p , u;i(0) = 0 

U) 2 = Fuj 2 + gy p , w 2 (0) = 0 
<t> = -p 0 (p + uj, 4>(0) = 0 

Up = 8 t lo + 9 T c(> = 6 t uj — (j) T T(j)ei sgn (k p /k m ) 
to = [uj , loJ , y p ,r] T , u>i G 1Z n ~ l , u o 2 G R n ~ l 

Adaptive law 

6 = -Tei((>sgn(k p /k m ), e\ = y p - y m 

Assumptions 

Z p (s ) is Hurwitz; VP m (s)(s+po) is strictly proper and 
SPR; F, g,T are as defined in Table 6.1; plant and 
reference model satisfy assumptions PI to P4, and 
Ml and M2, respectively 


where 


■P o 


{si - F)-'gG-\s)W m {s) 
(si - F)- { gW m {s) 
W m (s) 

1 


and 


-P o 


(si ~ F)~ 1 gG~ 1 (s) 
(si - F)- { g 
1 
0 


Because ei e £ 2 H Coa and e i — > 0 as t — > oo it follows (see Corollary 3.3.1) that 
</> G C -2 n £oc and |^| — > 0 as t — > oo. 
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Proceeding as in the proof of Theorem 6.4.1, we establish that <j> m is PE and use 
Lemma 4.8.3 to show that <j> is also PE which implies, using the results of Chapter 4, 
that \6\ — > 0 exponentially fast. Using (6.4.29) and the exponential convergence of 
\9\ to zero we obtain that ei converges to zero exponentially fast. □ 


Example 6.4.2 Let us consider the second order plant 

kp 

Vp = 7 2 i i \ u p 
(s z + ai s + a 0 ) 

where k p > 0, and ai , ao are constants. The reference model is chosen as 

5 


Using Table 6.2 the control law is designed as 


W 1 = — 2u>i + Up 

(1>2 = — 2^2 + Up 

(f> = — < j> + w 

u p = 0 T w — (jVTipei 

where u> = [wi, w 2 , y p , r] T ,ei = y p - y m ,Po = 1, A(s) = s + 2 and is SPR. 

The adaptive law is given by 

9 — —Teicf) 

where T = T t > 0 is any positive definite matrix and 9 = [61,62, 63, co] T . 

For parameter convergence, the input u v is chosen as 

u p = Ai sin wit + A 2 sinw 2 t 

for some A 1 ,A 2 ^ 0 and wi ^ u> 2 . 

Figures 6.10 and 6.11 show some simulation results of the MR AC scheme for 
the plant with ai = 3, do = — 10, k p = 1. We start with an initial parameter vector 
0(0) = [3, 18, —8, 3 ] t that leads to an initially destabilizing controller. The tracking 
error and estimated parameter error response is shown in Figure 6.10 for T = 
diag{2, 4, 0.8, 1}, and r = unit step. Due to the initial destabilizing controller, the 
transient response is poor. The adaptive mechanism alters the unstable behavior of 
the initial controller and eventually drives the tracking error to zero. In Figure 6.11 
we show the response of the same system when r = 3sin4.9t + 0.5cos0.7t is a 
sufficiently rich input of order 4. Because of the use of a sufficiently rich signal, 6(t) 
converges to 9* = [1, 12, —10, 1] T . V 
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(a) 



Figure 6.10 Response of the MRAC scheme for Example 6.4.2 with r(t) = 
unit step function. 


Remark 6.4.4 The control law (6.4.30) is a modification of the certainty 
equivalence control law u p = 9 T iv and is motivated from stability con- 
siderations. The additional term 9 T cj) = —cj^Tcpei sgn (p*) is a nonlin- 
ear one that disappears asymptotically with time, i.e., u p = 9 T u) + 9 T (j) 
converges to the certainty equivalence control law as f — > oo. The num- 
ber and complexity of the additional terms in the certainty equivalence 
control law increase with the relative degree n* as we demonstrate in 
the next section. 
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(a) 



Figure 6.11 Response of the MRAC scheme for Example 6.4.2 with 
r(t) = 3 sin AM + 0.5 sin 0.7 1. 


Remark 6.4.5 The proof of Theorem 6.4.2 may be accomplished by using 
a minimal state space realization for the error equation 

ei = W m (s)(s + p 0 )p*d T (p 

The details of such an approach are left as an exercise for the reader. 

6.4.3 Relative Degree n* = 3 

As in the case of n* = 2, the transfer function W m (s) of the reference 
model cannot be chosen to be SPR because according to assumption (M2), 
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W m (s) should have the same relative degree as the plant transfer function. 
Therefore, the choice of u p = 9 t lu in the error equation 

ei = Wm{s) P *{u p - 0* t lu) (6.4.33) 

will not lead to the desired error equation where the tracking error is related 
to the parameter error through an SPR transfer function. As in the case 
of n* = 2, let us rewrite (6.4.33) in a form that involves an SPR transfer 
function by using the techniques of Chapter 4, i.e. , we express (6.4.33) as 

ei = W m (s)(s + p 0 )(s + px)p* (uf - 9* T (f>^J (6.4.34) 

where 

1 , _ 1 
Uf (s + p 0 )(s + pl) Up, ( s + p 0 ){s + p 1 ) UJ 

and W m (s),po,pi are chosen so that W m (s) = 1 / P m (s)(s + Po)(s + p\) is 
SPR, which is now possible because the relative degree of W m (s) is 1. For 
simplicity and without loss of generality let us choose 

Wm<yS ^ ( s + p 0 )( s + pi )( s + ^ 0 ) 

for some qo > 0 so that 

e\ = — — — p*(v.f — 9* T (/)) (6.4.35) 

The estimate of e\ of e\ based on the estimates p. 9 is given by 

ei = — — — p(uf — 9 T (j) ) (6.4.36) 

If we proceed as in the case of n* = 2 we would attempt to choose 

u f = 9 T (t> (6.4.37) 



(6.4.38) 
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The adaptive law 

8 = -rei</>sgn(p*) (6.4.39) 

will then follow by using the standard procedure. Equation (6.4.37), how- 
ever, implies the use of the control input 

Up = {s + po)(s + Pi)uf = {s + p 0 )(s + pi)0 T (/> (6.4.40) 

which involves 8, that is not available for measurement. Consequently the 
control law (6.4.40) cannot be implemented and the choice of Uf = 8 T (j> is 
not feasible. 

The difficulty of not being able to extend the results for n* = 1, 2 to n* > 
3 became the major obstacle in advancing research in adaptive control during 
the 1970s. By the end of the 1970s and early 1980s, however, this difficulty 
was circumvented and several successful MRAC schemes were proposed using 
different approaches. Efforts to extend the procedure of n* = 1,2 to n* >3 
continued during the early 1990s and led to new designs for MRAC . One such 
design proposed by Morse [164] employs the same control law as in (6.4.40) 
but the adaptive law for 8 is modified in such a way that 9 becomes an 
available signal. This modification, achieved at the expense of a higher-order 
adaptive law, led to a MRAC scheme that guarantees signal boundedness 
and convergence of the tracking error to zero. 

Another successful MRAC design that has it roots in the paper of Feuer 
and Morse [54] is proposed in [162] for a third order plant with known high 
frequency gain. In this design, the adaptive law is kept unchanged but the 
control law is chosen as 

u p = 8 t lo + u a 

where u a is designed based on stability considerations. Below we present 
and analyze a very similar design as in [162] . 

We start by rewriting (6.4.35), (6.4.36) as 

ei = — - — p*(6 T (j) + r 0 ), ei = — - — pro (6.4.41) 

s + qo s + q 0 

where ro = Uf — $ T <j> and 9 = 8 — 8*. 

Because ro cannot be forced to be equal to zero by setting uj = 9 T (f>, we 
will focus on choosing u p so that ro goes to zero as t — > oo. In this case, 
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the estimation error e\ = e\ — e± is not equal to e± because e.\ / 0 owing to 
pro / 0. However, it satisfies the error equation 

ei = ei-ei = — 1 — (p*6 T 4>- pr 0 ) (6.4.42) 

s + qo 

that leads to the adaptive law 


6 = -T ei4>sgn(p*) , p = 'ye ir 0 (6.4.43) 

where T = T t and 7 > 0 by considering the Lyapunov-like function 


5T t^-1 


2 2 


r ~ l e 


Ip* Id- 


27 


We now need to choose u a in u p = d T ui + u a to establish stability for the 
system (6.4.41) to (6.4.43). Let us now express ro as 


ro = Uf — 9'(p = 


1 


s + po L 


ui — 9 T (j) — 6 J (f> 1 


where 


i.e., 


ui = — - — u p , (j)i = (s + po)4> = — - — u 
s + p 1 s + p\ 


ro = ~Poro + ui- 9 T cp - 0 T ep 1 
Substituting for 9 , we obtain 

ro = ~Po r o + ui + (p T P(pei sgn (p*) - 9 T (pi 


(6.4.44) 

(6.4.45) 


If we now choose u\ = — 0 T r<()eisgn( / o*) + 9 T cpi then ? % o = — Po r o and ?’o 
converges to zero exponentially fast. This choice of u \ , however, leads to a 
control input u p that is not implenrentable since u p = (s + pi)iii will involve 
the first derivative of ui and, therefore, the derivative of ei that is not 
available for measurement. Therefore, the term cp T T(pei sgn(p*) in (6.4.45) 
cannot be eliminated by u 1 . Its effect, however, may be counteracted by 
introducing what is called a “nonlinear damping” term in ui [99]. That is, 
we choose 

ui = 9 T (pi — ao (p) T T(p^j ?’o (6.4.46) 
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where cto > 0 is a design constant, and obtain 


r 0 = - 


Po + «o 


r 0 + < /> T r < />eisgn( / o*) 


The purpose of the nonlinear term (ffTf)' 2 is to “damp out” the possible 
destabilizing effect of the nonlinear term <^ T r^ei as we show in the analysis 
to follow. Using (6.4.46), the control input u p = ( s A Pi)u\ is given by 

Up = 9 t lo A 6 t 4 > i - (s + pi)ao(</> T r(/>) 2 ro (6.4.47) 


If we now perform the differentiation in (6.4.47) and substitute for the 
derivative of ro we obtain 

u p = 6 t lo A 0 T (j)\ - Aa 0 (/) T T(f) ?’ 0 - a 0 (pi - po) r 0 

+a 2 0 r 0 - a 0 (4> T T(j^ eisgn (p*) (6.4.48) 

where (j) is generated from 


(s + p Q )(s + pi) 


UJ 


which demonstrates that u p can be implemented without the use of differ- 
entiators. 

We summarize the main equations of the MRAC scheme in Table 6.3. 
The stability properties of the proposed MRAC scheme listed in Table 
6.3 are summarized as follows. 


Theorem 6.4.3 The MRAC scheme of Table 6.3 guarantees that 

(i) All signals in the closed-loop plant are bounded and ro(t), e\ (t) — > 0 as 
t — > oo. 

(ii) If k p is known, r is sufficiently rich of order 2 n and Z p , R p are coprime, 
then the parameter error \9\ = \6 — 9*\ and tracking error e\ converge 
to zero exponentially fast. 

(iii) If r is sufficiently rich of order 2n and Z p ,R p are coprime, then |0| and 
e\ converge to zero asymptotically (not necessarily exponentially fast). 

(iv) The estimate p converges to a constant p asymptotically independent of 
the richness of r. 
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Table 6.3 MRAC scheme: n* = 3 


Plant 

Vv ~ k PR P (s) u P’ n — 3 

Reference 

model 

Vm = W m (s)r 

Control law 

chi = Fuji + gu p , cji(O) = 0 

ch 2 = Fuj 2 + gy p , w 2 ( 0 ) = 0 

fo = ~{Po + aoO T r</>) 2 )r 0 + 0 T r0eisgn(p*) 

u p = 0 T u + u a 

u a = 0 T (j) i- ao(pi-po) (</> T r </>) 2 r o -4a O (/> T r(/>(0 T r0)ro 
+al((f Y T(t)) i ro - a 0 (^ T r</>) 3 eisgn(/ 9 *) 

Adaptive law 

0 = -rei^>sgn(p*), p = 7 eir 0 
ei = ei ei, e x = s l qQ pro 

4> = {s+po ) {s+pi) u, uj=[ujJ,ujJ, y p , r] 

4*1 — s _) - Pl ^i e l — Vp Vm 

Design 

variables 

T = r T > 0, 7 > 0 , ao > 0 are arbitrary design con- 
stants; W m (s)(s + po)(s + p\) is strictly proper and 
SPR; F,g are as in the case of rT = 1; Z p (s),R p (s) 
and W m {s) satisfy assumptions PI to P4, Ml and M2, 
respectively; sgn(p*) = sgn (k p /k m ) 


Proof (i) The equations that describe the stability properties of the closed-loop 
plant are 

ei = + P*0 T (t>~ pro 

fo = -(Po + ao(^ T r</>) 2 )r 0 + ^ T r^ei sgn(p*) 

9 = -Tei</> sgn(p*), p = yeiro 


(6.4.49) 
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We propose the Lyapunov-like function 


V = 


2 


+ \ P *\e T 



2 7 


+ 7o 


r 


2 

0 


2 


where 70 > 0 is a constant to be selected. The time derivative of V along the 
trajectories of (6.4.49) is given by 

V = -g o e?-7oWo-7oao?'o ( < / ,Tr ^) 2 + 7oeiro0 T r^sgn(p*) 

< -qo4 - 70W0 - loa-orl^Tcj)) 2 + To | ei | |r 0 |(/> T r</> 


By completing the squares we obtain 


<7o 


2 


I e i I 


■ 7o- 


\r 0 \(t) T T(t> 


<7o 


< -9oy -7oPo»o - 


OtQ 


7o 

2<7o 


+ 7 ° Sjo 
7o 7 ’o(<^ T r</>) 2 


7oWo - 7o a o r o(^ Tr ^) 2 


Because 70 > 0 is arbitrary, used for analysis only, for any given ou and qo > 0, we 
can choose it as 70 = 2 ao< 7 o leading to 

e? 2 

V <-q 0 — - 7oPo r o < 0 

Hence, ei,ro,p,9 E £30 and ei,ro E £ 2 - Because p, Tq E £ 00 , it follows from 
(6.4.41) that ei G £,,0 which implies that ei = ei + ey G £oo. Hence, y p G £oo, 
which, together with 


1 

(s + p 0 )(s + pi) 


{si ~ F)~ 1 gG~ 1 (s)y p 
( sI-F ) i gy p 
Vp 


(6.4.50) 


implies that ^ G Coo- Using ei ,<j> G £00 and ei G £2 in (6.4.49) we have 9,p £ 
Loo f) £ 2 - From the error equation (6.4.49) it follows that ei and, therefore, y p G £00 
which imply that <f> and <j> 1 G Coo- The second derivative e\ can be shown to be 
bounded by using 9,(j>,fo G £00 hr (6.4.41). Because e\ = y p — s 2 W m (s)r and 
s 2 W m (s) is proper, it follows that y p G £oo, which, together with (6.4.50), implies 
that (j> G Coo- Because u> = (s + po)(s + Pi)<t>, we have that u> G Coo and therefore 
u p and all signals are bounded. Because ro G Coo{^\C -2 and p G Coo, it follows 
from (6.4.41) that ei G Coo f) C 2 - which, together with ei G Coo 0 ^ 2 , implies that 
ei G Coo f) Ci- Because ei G Coo and ei G £ 2 , it follows that e\{t) — > 0 as t — > 00 
and the proof of (i) is complete. From ro G £ 2,^0 G £00 we also have ro(t) — > 0 as 
t — > 00 . 



6.4. DIRECT MRAC WITH UNNORMALIZED ADAPTIVE LAWS 369 


(ii) First, we show that 4> is PE if r is sufficiently rich of order 2 n. Using the 
expression (6.4.50) for </> and substituting y p = W m r + e\, we can write 

= fim + 4 > 


where 


and 


(s+Po)(s+Pl) 


(si - F)~ 1 gG~ 1 W m 
(sI-F)~ 1 gW m 
W m 

1 


(s + Po)(s + Pi) 


(si — F)~ 1 gG~ 1 
(si ~ F)~ 1 g 
1 
0 


Using the same arguments as in the proof of Theorem 6.4.2, we can establish that 
</> m is PE provided r is sufficiently rich of order 2 n and Z p , R p are coprime. Then 
the PE property of <p follows immediately from Lemma 4.8.3 and ei £ £ 2 - 
If k p is known, then p = 0 and (6.4.49) is reduced to 


<T = -q 0 ei+ p*9 T cj) 

9 = — rei</>sgn(p*) (6.4.51) 


We can use the same steps as in the proof of Corollary 4.3.1 in Chapter 4 to show 
that the equilibrium = 0, 9 e = 0 of (6.4.51) is e.s. provided <fi is PE and 
(j), (j) £ £oo. Since we have established in (i) that <j>, <j> £ C^, (ii) follows. 

(iii) When k p is unknown, we have 

U = -q 0 e 1 + p*6 T (j)- pr 0 

9 = —Tei(j)Sgn(p*) (6.4.52) 


We consider (6.4.52) as a linear-time- varying system with ei, 6 as states and pro as 
the external input. As shown in (ii), when pro = 0 and <f> is PE, the homogeneous 
part of (6.4.52) is e.s. We have shown in (i) that p € Coo, U) G Coo f) ^2 and r 0 (t) —> 
0 as t — » 00 . Therefore, it follows (by extending the results of Corollary 3.3.1) that 
ei , 9 € Coo fl ^2 that (t),9(t) — > 0 as t — > 00 . 

(iv) Because ei,ro G C 2 we have 



< 

< 


7 / |ei||r 0 |dr 
Jo 


7( 


&t)H[ c 

Jo 


rgdr ) 2 < 00 
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which for t — > oo implies that p = p £ L i and therefore p, p converge to a constant 
as t — > oo. □ 


Example 6.4.3 Let us consider the third order plant 


yp — 3 i 2 i i u p 

S' 3 + a 2 s z + a\s + a 0 

where fc p ,ao,ai,a 2 are unknown constants, and the sign of k p is assumed to be 
known. The control objective is to choose u p to stabilize the plant and force the 
output y p to track the output y m of the reference model given by 

Vm = (7+2)^ r 

Because n* = 3, the MR.AC scheme in Table 6.3 is considered. We choose pi = po = 
2 so that W m (s)(s+pi)(s+po) = 775 SPR. The signals to,4>, </>i are generated as 



' -10 

-25 ' 

u> 1 + 

' 1 ' 


wi(0) = 

' 0 ' 

d>l = 

1 

0 

0 

Up ? 

0 


' -10 

-25 

U>2 + 

' 1 


w 2 (0) = 

' 0 ' 

U>2 = 

1 

0 

0 

y P , 

0 




r ,T . , 

T 

iT 





u> = 


2 ,Vp 





1 


1 


-u, <p = 


T s + 2 T s + 2 

by choosing A(s) = (s + 5) 2 . Then, according to Table 6.3, the adaptive control law 
that achieves the control objective is given by 

■Wp = 0 T tu — eiSgn(fc p )</) T r</) 1 — 4a O (/) T r(/)(0 T r^)ro+ag(0 T r^) 4 ro — ao( < / >T P < / > ) 3e i s gn(/cp) 

6 = -Tei^sgn(fcp), p = jti r 0 


£i — ei — ei, ei — y p 
and rg is generated by the equation 


ei = 


s T 2 


pro 


r 0 = -(2 + a o (0 T I» 2 )r o + 0 T T</>eiSgn(fcp) 


The simulation results of the MRAC scheme for a unit step reference input are 
shown in Figure 6.12. The plant used for simulations is an unstable one with transfer 
function G p (s ) = s 3_|_ 6s 2^ 3s _ 10 • The initial conditions for the controller parameters 
are: 9q = [1.2, —9, 31, 160, —50, 9] T , and p(0) = 0.2. The design parameters used 
for simulations are: T = 50J, 7 = 50, a 0 = 0.01. V 



tracking error 
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Figure 6.12 Response of the MRAC scheme for Example 6.4.3 with r(t)= 
unit step function. 


Remark 6.4.6 The effect of initial conditions can be taken into account by 
using 

ei = W m (s)p*(u p - 6* t lu) + Cj (si - AJ-'ei 0) 

instead of (6.4.33). The proof can be easily modified to take care of 
the exponentially decaying to zero term that is due to e(0) / 0 (see 
Problem 6.19). 

Similarly, the results presented here are valid only if the existence 
and uniqueness of solutions of (6.4.49) can be established. For further 
discussion and details on the existence and uniqueness of solutions of 
the class of differential equations that arise in adaptive systems, the 
reader is referred to [191]. 
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Remark 6.4.7 The procedure for n* = 3 may be extended to the case of 
n* > 3 by following similar steps. The complexity of the control input 
Up, however, increases considerably with n* to the point that it defeats 
any simplicity we may gain from analysis by using a single Lyapunov- 
like function to establish stability. In addition to complexity the highly 
nonlinear terms in the control law may lead to a “high bandwidth” 
control input that may have adverse effects on robustness with respect 
to modeling errors. We will address some of these robustness issues in 
Chapter 8. On the other hand, the idea of unnornralized adaptive laws 
together with the nonlinear modification of the certainty equivalence 
control laws were found to be helpful in solving the adaptive control 
problem for a class of nonlinear plants [98, 99, 105]. 


6.5 Direct MRAC with Normalized Adaptive 
Laws 

In this section we present and analyze a class of MRAC schemes that domi- 
nated the literature of adaptive control due to the simplicity of their design 
as well as their robustness properties in the presence of modeling errors. 
Their design is based on the certainty equivalence approach that combines a 
control law, motivated from the known parameter case, with an adaptive law 
generated using the techniques of Chapter 4. The adaptive law is driven by 
the normalized estimation error and is based on an appropriate parameteri- 
zation of the plant that involves the unknown desired controller parameters. 
While the design of normalized MRAC schemes follows directly from the 
results of Section 6.3 and Chapter 4, their analysis is more complicated than 
that of the unnornralized MRAC schemes presented in Section 6.4 for the 
case of n* = 1,2. However, their analysis, once understood, carries over to 
all relative degrees of the plant without additional complications. 

6.5.1 Example: Adaptive Regulation 

Let us consider the scalar plant 


x = ax + u, x(0) = xq 


(6.5.1) 
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where a is an unknown constant and —a m is the desired closed-loop pole for 
some a m > 0. 

The desired control law 

u = —k*x, k* = a + a m 

that could be used to meet the control objective when a is known, is replaced 
with 

u = —k(t)x (6.5.2) 

where k(t) is to be updated by an appropriate adaptive law. In Section 
6.2.1 we updated k(t) using an unnornralized adaptive law driven by the 
estimation error, which was shown to be equal to the regulation error x. 
In this section, we use normalized adaptive laws to update k(t). These are 
adaptive laws driven by the normalized estimation error which is not directly 
related to the regulation error x. As a result, the stability analysis of the 
closed- loop adaptive system is more complicated. 

As shown in Section 6.2.1, by adding and subtracting the term —k*x in 
the plant equation (6.5.1) and using k* = a + a m to eliminate the unknown 
a, we can obtain the parametric plant model 


x = —a m x + k*x + u 


whose transfer function form is 

x = — - — ( k*x + u) (6.5.3) 

S + dm 

If we now put (6.5.3) into the form of the general parametric model 
z = W(s)6 * T i/j considered in Chapter 4, we can simply pick any adaptive 
law for estimating k* on-line from Tables 4.1 to 4.3 of Chapter 4. Therefore, 
let us rewrite (6.5.3) as 

z = — - — k*x (6.5.4) 

s + a m 

where z = x 1 — u is available from measurement. 

s+a m 

Using Table 4.1 of Chapter 4, the SPR-Lyapunov design approach gives 
the adaptive law 
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e = z — z en 2 (6.5.5) 

s + a m 

~ 1 , 2 2 

z = kx, n s = x 

s + a m 

where 7 > 0 is the adaptive gain and L(s) in Table 4.1 is taken as L(s) = 1. 

Rewriting (6.5.4) as z = k *(f>,4> = ^ x, we use Table 4.2(A) to obtain 
the gradient algorithm 

k = 7 e<f> 

e = m 2 = 1 + 4> 2 (6.5.6) 

m - 

4> = x, z = kcj), 7 > 0 

s + a m 

and from Table 4.3(A), the least-squares algorithm 

p 2 (b 2 

k = pecj), p= 5-, p(0) > 0 

m z 

e = — k-, z = k(j>, mi 2 = 1 + 4> 2 , <f> = x (6.5.7) 

m z s + a m 

The control law (6.5.2) with any one of the three adaptive laws (6.5.5) 
to (6.5.7) forms an adaptive control scheme. 

We analyze the stability properties of each scheme when applied to the 
plant (6.5.1) as follows: We start by writing the closed-loop plant equation 
as 

x = — - — (—kx) (6.5.8) 

s + cim 

by substituting u = — kx in (6.5.3). As shown in Chapter 4 all three adaptive 
laws guarantee that k E Coo independent of the boundedness of x, u, which 
implies from (6.5.8) that x cannot grow or decay faster than an exponential. 
However, the boundedness of k by itself does not imply that x 6 Coo , let 
alone x(t) — > 0 as t —+ 00. To analyze (6.5.8), we need to exploit the 
properties of kx by using the properties of the specific adaptive law that 
generates k(t). 

Let us start with the adaptive law (6.5.5). As shown in Chapter 4 using 
the Lyapunov-like function 
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and its time derivative 


V = —a m e 2 — e 2 n 2 < 0 

the adaptive law (6.5.5) guarantees that e, k £ £ 0 0 and e,en s ,k £ £2 in- 
dependent of the boundedness of x. The normalized estimation error e is 
related to kx through the equation 

e = z — z — en 2 = — - — (—kx — en 2 ) (6.5.9) 

s + a m s + a m 

where n 2 = x 2 . Using (6.5.9) and en 2 = en s x in (6.5.8), we obtain 

1 2 1 , 

x = e H en s = e H en s x (6.5.10) 

s + CL m s + a m 

Because e £ f) £2 and en s £ £2 the boundedness of x is established 

by taking absolute values on each side of (6.5.10) and applying the B-G 
lemma. We leave this approach as an exercise for the reader. 

A more elaborate but yet more systematic method that we will follow in 
the higher order case involves the use of the properties of the C 25 norm and 
the B-G Lemma. We present such a method below and use it to understand 
the higher-order case to be considered in the sections to follow. 


Step 1 . Express the plant output y (or state x) and plant input u in 
terms of the parameter error k. We have 


x = 


s T a 


(—kx), u = (s — a)x = — —(—kx) 


s + a n 


(6.5.11) 


The above integral equations may be expressed in the form of algebraic 
inequalities by using the properties of the £25 norm || (-)* I| 2 < 5 j which for sim- 
plicity we denote by || • ||. 

We have 

j|x|| < c||for||, 1M| < c\\kx\\ (6.5.12) 


where c > 0 is a generic symbol used to denote any finite constant. Let us 
now define 

m 2 f = 1 + \\x \\ 2 + \\u \\ 2 (6.5.13) 

The significance of the signal mj is that it bounds |x|, |i| and |it| from above 
provided k £ £oo. Therefore if we establish that rrif £ then the bounded- 
ness of all signals follows. The boundedness of \x\/rrif, \x\/mf, \u\/rrif follows 



376 


CHAPTER 6. MODEL REFERENCE ADAPTIVE CONTROL 


from k G Toe and the properties of the £ 25 -nornr given by Lemma 3.3.2, i.e. , 
from (6.5.11) we have 


and 


Similarly, 


\x{t)\ 

m s 


< 


1 


s + a r 


\k\"^"<c 
26 rn f 


|i(t)| <a m M + \k\m< c 


rn f 


m f 


rn f 


\u(t)\ . \x\ 

L -^i < k — < c 
mf mf 


Because of the normalizing properties of mf, we refer to it as the fictitious 
normalizing signal. 

It follows from (6.5.12), (6.5.13) that 


m 2 f < 1 + c||A;x’|| 2 


(6.5.14) 


Step 2. Use the Swapping Lemma and properties of the €.25 norm to 
upper bound ||fcx|| with terms that are guaranteed by the adaptive law to have 
finite €2 gains. We use the Swapping Lemma A. 2 given in Appendix A to 
write the identity 

kx = (l — — kx H — — kx = — - — (kx + kx) H — — kx 

\ S ~\~ Q(q / S + C^o S + C^o S + Q!q 


where ckq > 0 is an arbitrary constant. Since, from (6.5.11), kx 
we have 


kx 


1 

S + Qq 


( kx + kx) — «o 


(s + a m ) 
(s + «o) X 


which imply that 


-(s+a m )x, 

(6.5.15) 


cx < 


1 


s + q;q 


oo(5 


(\\kx\\ + \\kx\\) + a 0 


s + «o 


X 


oo<5 


For a 0 > 2 a m > 6, we have Hjq^HocS = 2^5 < therefore, 
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where c = ||f^||oo<5- Since ^ <E £oo, it follows that 

\\kx\\ < —(\\km.f\\ + II&to/H) + aoc||x|| (6.5.16) 

a o 

Equation (6.5.16) is independent of the adaptive law used to update k(t). 
The term in (6.5.16) is “small” because k E C 2 (guaranteed by any 

one of the adaptive laws (6.5.5) - (6.5.7)), whereas the term can 

be made small by choosing cto large but finite. Large ao, however, may 
make aocIMI large unless ||x|| is also small in some sense. We establish 
the smallness of the regulation error x by exploiting its relationship with 
the normalized estimation error e. This relationship depends on the specific 
adaptive law used. For example, for the adaptive law (6.5.5) that is based 
on the SPR-Lyapunov design approach, we have established that 

1 2 

x = e H en s 

s + a m 

which together with |en^| < \en s \j^mf < cen s mf imply that 

IM| < ||e|| + c\\en s mf\\ 


hence, 

\\kx\\ < — (||fcr77./|| + ||fcm/||) + a 0 c||e|| + a 0 c\\en s mf\\ (6.5.17) 

oc 0 

Similarly, for the gradient or least-squares algorithms, we have 

x = em 2 — k(j) (6.5.18) 

8 + «m 

obtained by using the equation 

kx = kcj) kef) 

s + dm s + a m 

that follows from Swapping Lemma A.l together with the equation for em 2 
in (6.5.6). Equation (6.5.18) implies that 
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Because n 2 = </> 2 and cj> = x, we have \4>(t)\ < c\\x\\ which implies that 
G Coo and, therefore, 

171 f 

|| a: || < ||e|| + ||en s m/|| + c||fem/|| 

Substituting for ||,t|| in (6.5.16), we obtain the same expression for ||fcx|| as 
in (6.5.17). 


Step 3. Use the B-G Lemma to establish boundedness. From (6.5.14) 
and (6.5.17), we obtain 

m 2 < 1 + OqC + -^r(||fcm/|| 2 + ||fcm/|| 2 ) + cao||en s m/|| 2 (6.5.19) 
Q o 

by using the fact that e G Coo f)-^ 2 - We can express (6.5.19) as 

m 2 < 1 + «oC+ ||m/|| 2 + cal\\gmf \\ 2 (6.5.20) 

a 0 


where g 2 = \en s \ 2 + Because the adaptive laws guarantee that en s , k G 
a o 

£2 it follows that g G £ 2 - Using the definition of the C 25 norm, inequality 
(6.5.20) may be rewritten as 


m 2 < 1 + cal + c J e 5 ^ ^ ^ alg 2 (r ) 
Applying the B-G Lemma III, we obtain 



m 2 (r)dT 


m 2 <(l + cal)e s ^ r i<h(t, to) + (1 + cafyd e T ^(t,r)dT 


where 


$(t,r) = e ^ (i T) e c />oS 2 W^ 


Choosing «o so that -% < f, «o > 2 a m and using g G £ 2 , it follows that 
a o 

m,f G Coo- Because mf bounds x,x,u from above, it follows that all signals 
in the closed-loop adaptive system are bounded. 


Step 4. Establish convergence of the regulation error to zero. For the 
adaptive law (6.5.5), it follows from (6.5.9), (6.5.10) that x G £2 and from 
(6.5.8) that x G Coo- Hence, using Lemma 3.2.5, we have x(t) — > 0 as t — > 00 . 
For the adaptive law (6.5.6) or (6.5.7) we have from (6.5.18) that x G £2 
and from (6.5.8) that x G Coo , hence, x(t) — > 0 as t — ► 00 . 
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6.5.2 Example: Adaptive Tracking 

Let us consider the tracking problem defined in Section 6.2.2 for the first 
order plant 

x = ax + bu (6.5.21) 

where a, b are unknown (with 6^0). The control law 

u = —k*x + Tr (6.5.22) 

where 

k * = c hn + a , r = brn ( 6 . 5 . 23 ) 

guarantees that all signals in the closed-loop plant are bounded and the plant 
state x converges exponentially to the state x m of the reference model 

x m = -hp— r (6.5.24) 

Because a, b are unknown, we replace (6.5.22) with 

u = —k(t)x + l(t)r (6.5.25) 

where k(t),l(t) are the on-line estimates of k*,l*, respectively. We design 
the adaptive laws for updating k(t), l(t) by first developing appropriate 
parametric models for k*,l* of the form studied in Chapter 4. We then 
choose the adaptive laws from Tables 4.1 to 4.5 of Chapter 4 based on the 
parametric model satisfied by k * , l* . 

As in Section 6.2.2, if we add and subtract the desired input —bk*x + bl*r 
in the plant equation (6.5.21) and use (6.5.23) to eliminate the unknown a, 
we obtain 

x = —a m x + b m r + b(u + k* x — Tr) 
which together with (6.5.24) and the definition of ei = x — x m give 

ei = — - — (u + k*x-l*r) (6.5.26) 


Equation (6.5.26) can also be rewritten as 

e\ = b(9* T (j) + uj ) 


(6.5.27) 
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where 0* = [k*,l*],<f> = ^T—[x,—r] T ,Uf = s+ 1 a u. Both equations are in 
the form of the parametric models given in Table 4.4 of Chapter 4. We can 
use them to choose any adaptive law from Table 4.4. As an example, let us 
choose the gradient algorithm listed in Table 4.4(D) that does not require 
the knowledge of sign b. We have 


k 

i 

b 


N(w)'yi€(j)i 

N(w)^4>2 

N(w)'yef 


N(w) = w 2 cos re, w = wq + 


2 7 


w o 
e 

£ 

4>i = 
m 2 = 


e z mr , ico(0) = 0 
ei - ei 


rrr 


e\ = N(w)b £ 


= k(j)i + l(j)2 + Uf, Uf = 


1 


-X, (j) 2 = -- 


1 


s + a m s + a m 

1 + n si n s = dl + 02 + 

7i> 72) 7 > 0 


(6.5.28) 


As shown in Chapter 4, the above adaptive law guarantees that k, l, w, 
w 0 € Too and e,en s ,k,l,b 6 £ooD -^2 independent of the boundedness of 

Despite the complexity of the adaptive law (6.5.28), the stability anal- 
ysis of the closed-loop adaptive system described by the equations (6.5.21), 
(6.5.25), (6.5.28) is not more complicated than that of any other adaptive 
law from Table 4.4. We carry out the stability proof by using the properties 
of the /125-norm and B-G Lemma in a similar way as in Section 6.5.1. 

Step 1. Express the plant output x and input u in terms of the parameter 
errors k,l. From (6.5.24), (6.5.25) and (6.5.26) we have 

x = x m (kx~ Ir ) = ( b m r + blr — bkx) (6.5.29) 

s + a m ' ' s + a m ' ' 
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and from (6.5.21), (6.5.29) 

u = — — — - — — — x = b m r + blr — bkx (6.5.30) 

b b(s + a m ) L 

For simplicity, let us denote || (•)* II 25 by li ' II- Again for the sake of clarity 
and ease of exposition, let us also denote any positive finite constant whose 
actual value does not affect stability with the same symbol c. Using the 
properties of the C 2 S~norm in (6.5.29), (6.5.30) and the fact that r,l £ Coo 
we have 

INI < c + c\\kx\\, )|u|| < c + c\\kx\\ 

for any 5 £ [0, 2a m ), which imply that the fictitious normalizing signal de- 
fined as 

2 A 1 1 11 11 2 , n 11 2 

m f = 1 + ||x|| + \\u\\ 

satisfies 

m 2 < c + c\\kx \\ 2 (6.5.31) 

We verify, using the boundedness of r,l,k, that <f>i/mf, x/m,f, n s /rrif £ Coo 
as follows: From the definition of (j) 1 , we have |</>i(t)| < c||x|| < crrif. Simi- 
larly, from (6.5.29) and the boundedness of r,l, k, we have 

|x(f)| < c + c||x|| < c + crrif 

Because x = —a m x + b m r + blr — bkx, it follows that |x| < c + cm/. Next, 
let us consider the signal n 2 s = 1 + (j ) 2 + <\>\ + u 2 . Because \uf\ < c||u|| < cm/ 
and An, fa £ Coo 1 it follows that n s < crrif. 

rrif j 

Step 2. Use the Swapping Lemma and properties of the C 26 norm to 
upper bound ||fcx|| with terms that are guaranteed by the adaptive law to have 
finite C 2 - gains. We start with the identity 

kx = (l — — ^ kx -\ — — kx = — - — (kx + k±) H — — kx 

(6.5.32) 

where ckq > 0 is an arbitrary constant. From (6.5.29) we also have that 


kx 


( s + a m ) 

b 


ei + Ir 
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where e\ = x — x m , which we substitute in the second term of the right-hand 
side of (6.5.32) to obtain 


kx = 


1 


S + «0 

Because k,l,r £ C 0 0 we have 


> i ; • 1 ('S A Clm) OiQ ~ 

kx + kx | — * r-ei H lr 

b (s + a 0 ) s + a 0 


||fea;|| < — ||£ix| 
«o 


«0 


a 0 c||ei| 


(6.5.33) 


for any 0 < 5 < 2 a m < «o- 

As in Section 6.5.1, the gain of the first two terms on the right-hand side 
of (6.5.33) can be reduced by choosing «o large. So the only term that needs 
further examination is cko c II e i II • The tracking error e\, however, is related 
to the normalized estimation error e through the equation 


ei = em 2 + N(w)b£ = e + en 2 s + N{w)b £ 


that follows from (6.5.28). 

Because e,en s £ jCoof)-^ an d N(w)b £ Coo, the signal we need to con- 
centrate on is £ which is given by 

£ = k(j ) i + l(j > 2 -I — u 

s + a m 


We consider the equation 


u = (— kx + lr) = —k(j ) i — l(p 2 H 

s + a m s + d m s + d r 


(k(j>\ + lr 


obtained by using the Swapping Lemma A.l or the equation 

(s + d m )(k(j) i + l(j> 2) = kx - lr + (kepi + Z0 2 ) = u + k(j>i + 


Using any one of the above equations to substitute for k\4> 1 + Z </> 2 in the 
equation for £ we obtain 

£ = — ^ — (A:0i + ifc) 

S + d m 

hence 

ei = e + en^ + N(w)b + l(f> 2 ) (6.5.34) 
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Because e, N(w), b, l, fo, r G T 0 0 it follows from (6.5.34) that 

1 1 ei 1 1 < c+ ||en 2 || + c||fc(£i|| (6.5.35) 

and, therefore, (6.5.33) and (6.5.35) imply that 

\\kx\\ < c + ca o H ||fcx|| H — -||x|| + aoc||era 2 || + ctoc|| || (6.5.36) 

Step 3. Use the B-G Lemma to establish boundedness. Using (6.5.36) 
and the normalizing properties of mf, we can write (6.5.31) in the form 



< c+ cal + ca o\\9 m f\\ 2 + “rll m /l| 2 (6.5.37) 

«o 


where g 2 = -\|A;| 2 + |e 2 n 2 | + \k\. Inequality (6.5.37) has exactly the same 
a o 

form and properties as inequality (6.5.20) in Section 6.5.1. Therefore the 
boundedness of mj follows by applying the B-G Lemma and choosing ag > 
max{4a, 2 n , y} as in the example of Section 6.5.1. 

From mf G Too we have x,x,n s ,(j>i G Too, which imply that u and all 
signals in the closed-loop plant are bounded. 

Step 4. Establish convergence of the tracking error to zero. We show the 
convergence of the tracking error to zero by using (6.5.34). From e, en s , k, l G 
T 2 and n s , N(w)b , <f>i, 4>2 £ Too we can establish, using (6.5.34), that ei G T 2 
which together with e\ = x — x m G Too imply (see Lemma 3.2.5) that 
ei(t) 0 as t — > 00 . 

6.5.3 MRAC for SISO Plants 

In this section we extend the design approach and analysis used in the ex- 
amples of Sections 6.5.1 and 6.5.2 to the general SISO plant (6.3.1). We con- 
sider the same control objective as in Section 6.3.1 where the plant (6.3.1) 
and reference model (6.3.4) satisfy assumptions PI to P4, and Ml and M2, 
respectively. 

The design of MRAC schemes for the plant (6.3.1) with unknown pa- 
rameters is based on the certainty equivalence approach and is conceptually 
simple. With this approach, we develop a wide class of MRAC schemes by 
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combining the MRC law (6.3.22), where 9* is replaced by its estimate 9(t), 
with different adaptive laws for generating 9(t ) on-line. We design the adap- 
tive laws by first developing appropriate parametric models for 6* which we 
then use to pick up the adaptive law of our choice from Tables 4.1 to 4.5 in 
Chapter 4. 

Let us start with the control law 

Up = 9j ( t ) Up P 9j ( t ) + #3 (t)y p P c 0 {t)r (6.5.38) 

whose state-space realization is given by 

tu\ = Fu>i +gu pi wi(0) = 0 

(I>2 = Flo 2 + gyp, ^(0) = 0 (6.5.39) 

Up = 6 t uj 

where 9 = [9j , 9j , 9^, co] T and lo = [uj ,uj ,y p ,r] T , and search for an adap- 
tive law to generate 6{t), the estimate of the desired parameter vector 9* . 
In Section 6.4.1 we develop the bilinear parametric model 

ei = W m (s)p* [, Up - 9 * t lo } (6.5.40) 

where p* = ^*,9* = , 9% T , 9 3 , cj$] T by adding and subtracting the desired 

control input 9* t lo in the overall representation of the plant and controller 
states (see (6.4.6)). The same parametric model may be developed by using 
the matching equation (6.3.13) to substitute for the unknown plant polyno- 
mial Rp(s) in the plant equation and by cancelling the Hurwitz polynomial 
Zp(s) . The parametric model (6.5.40) holds for any relative degree of the 
plant transfer function. 

A linear parametric model for 9* may be developed from (6.5.40) as 
follows: Because p* = -^ and 9* t lo = 9q T loo Pc^r where 9 [ j = [#( T , 9i, T , #3] T 
and loq = [t^, loJ, Vp] T , we rewrite (6.5.40) as 

W m (s)u p = c* 0 e 1 + W m (s)9* 0 T io 0 P c* 0 W m (s)r 

Substituting for e\ = y p — y m and using y m = W m (s)r we obtain 

W m (s)u p = c* 0 y p P W m (s)9* 0 T lo 0 
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which may be written as 

z = 6* T (j) p (6.5.41) 

where 


^ = W m (s)u p 

4> P = [W m (s)uJ ,W m (s)uJ , W m (s)y p ,y p \ T 
0 * = [Of ,6f ,0* 3 ,c* o f 

In view of (6.5.40), (6.5.41) we can now develop a wide class of MRAC 
schemes by using Tables 4.1 to 4.5 to choose an adaptive law for 6 based on 
the bilinear parametric model (6.5.40) or the linear one (6.5.41). 

Before we do that, let us compare the two parametric models (6.5.40), 

(6.5.41) . The adaptive laws based on (6.5.40) listed in Table 4.4 of Chap- 
ter 4 generate estimates for Cq as well as for p* = 4r. In addition, some 
algorithms require the knowledge of the sgn(p*) and of a lower bound for 
\p* | . On the other hand the adaptive laws based on the linear model (6.5.41) 
generate estimates of Cg only, without any knowledge of the sgn(p*) or lower 
bound for |p*|. This suggests that (6.5.41) is a more appropriate parameter- 
ization of the plant than (6.5.40). It turns out, however, that in the stability 
analysis of the MRAC schemes whose adaptive laws are based on (6.5.41), 
l/co(t) is required to be bounded. This can be guaranteed by modifying 
the adaptive laws for co(t) using projection so that |co(t)| > c 0 > 0,Vt > 0 
for some constant c 0 < |cg| = |^pj. Such a projection algorithm requires 

the knowledge of the sgn(cg) = sgn and the lower bound c 0 , which is 

calculated from the knowledge of an upper bound for \k p \. Consequently, as 
far as a priori knowledge is concerned, (6.5.41) does not provide any spe- 
cial advantages over (6.5.40). In the following we modify all the adaptive 
laws that are based on (6.5.41) using the gradient projection method so that 
|cg(f)| > c 0 > 0 Vt > 0. 

The main equations of several MRAC schemes formed by combining 
(6.5.39) with an adaptive law from Tables 4.1 to 4.5 based on (6.5.40) or 

(6.5.41) are listed in Tables 6.4 to 6.7. 

The basic block diagram of the MRAC schemes described in Table 6.4 is 
shown in Figure 6.13. An equivalent representation that is useful for analysis 
is shown in Figure 6.14. 
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Table 6.4 MRAC schemes 


Plant 

n, — u z pO v 
yp — K pr p (s ) a p 

Reference model 

Urn = = W m ( S )r 

Control law 

6j\ = Foj\ + gu p , uq(0) = 0 
t 02 = Fuj 2 + gy p , cu 2 (0) = 0 

Up = 9 t lo 

0 = [9j ,9j ,9 3 ,c 0 \ T = [u>J ,y p ,r] T 

UiSK”- 1 , i = 1,2 

Adaptive law 

Any adaptive law from Tables 6.5, 6.6 

Assumptions 

Plant and reference model satisfy assumptions PI 
to P4 and Ml, M2 respectively. 

Design variables 

F,g chosen so that (si — F)~ 1 g = where 

a(s) = [s n-2 , s n-3 , . . . , s, 1] T for n > 2 and a(s) = 

0 for n = 1; A(s) = s" - ^ 1 + A„,_ 2 s n ~ 2 + • • • + Ao is 
Hurwitz. 


Figure 6.14 is obtained by rewriting u p as 

Up = 9 * t u> + 9 t lo 


where 9 = 9 — 9*, and by using the results of Section 6.4.1, in particular, 
equation (6.4.6) to absorb the term 9* t iu. 

For 9 = 9 — 9* = 0, the closed-loop MRAC scheme shown in Figure 6.14 
reverts to the one in the known parameter case shown in Figure 6.5. For 
0 / 0, the stability of the closed-loop MRAC scheme depends very much on 

the properties of the input \9 t lu, which, in turn, depend on the properties 

c o 

of the adaptive law that generates the trajectory 9(t ) = 9{t) — 9*. 
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Table 6.5 Adaptive laws based on ei = W m {s)p*(u p - d* T u ;) 


A. Based on the SPR-Lyapunov approach 

Parametric 

model 

ei = W m (s)L(s)[p*(u f - d* T (/))] 

Adaptive law 

6 = -Te<j) sgn(k p /k m ) 

P = 7 e Ci e = ei-ei -W m (s)L(s)(en 2 ) 
ei = W m (s)L(s)[p(uf - 0 T cj))\ 

£ = Uf — 9 t 4>, 4> = L^ 1 (s)ui 

Uf = L^ 1 (s)u p , n 2 = (j) T 4> + u 2 f 

Assumptions 

Sign (k p ) is known 

Design 

variables 

T = r T > 0,7 > 0 ; W m (s)L(s) is proper and SPR; 
L^ 1 (s) is proper and has stable poles 

B. Gradient algorithm with known sgn (k p ) 

Parametric 

model 

ei = p*(uf - 0* T (j)) 

Adaptive law 

0 = -Tecj) sgn (k p /k m ) 

P = 

^ — ex-ex 
e ~ 

ei = p(uf - 9 t 4>) 

(/> = W m (s)uj, Uf = W m (s)u p 

£ = Uf — 6 T 4> 

m 2 = 1 + c/> T 0 + u 2 

Design 

variables 

r = r T > 0,7 > o 
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Table 6.5 (Continued) 


C. Gradient algorithm with unknown sgn (k p ) 

Parametric 

model 

ei =p*(u f -6* T f>) 

Adaptive law 

6 = —N(xo)Te(j) 
p = N(x 0 )je£ 

N(x 0 ) = Xg cosxo 

x 0 = W 0 + w 0 = e 2 m 2 , tc 0 (0) = 0 

e = = N(x 0 )p{u f 6 T <j>) 

<t> = W m (s)uj, u f = W m (s)up 
£ = up — 6 T (f>, m 2 = 1 + n 2 ,n 2 = (f T <f> + u 2 

Design 

variables 

r = r T > 0 , 7 > 0 


The following theorem gives the stability properties of the MRAC scheme 
shown in Figures 6.13 and 14 when the adaptive laws in Tables 6.5 and 6.6 
are used to update Oft ) on-line. 

Theorem 6.5.1 The closed-loop MRAC scheme shown in Figure 6.13 and 
described in Table 6.4 with any adaptive law from Tables 6.5 and 6.6 has the 
following properties: 

(i) All signals are uniformly bounded. 

(ii) The tracking error e\ = y v — y m converges to zero as t. — > 00. 

(iii) If the reference input signal r is sufficiently rich of order 2 n, r £ 
and R p , Z p are coprime, the tracking error e\ and parameter error 
6 = 6 — 0* converge to zero for the adaptive law with known sgn(k p ) . 
The convergence is asymptotic in the case of the adaptive law of Table 
6.5(A, B) and exponential in the case of the adaptive law of Table 6.6. 
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Table 6.6 Adaptive laws based on W m {s)u p = d* T <f> p 


Constraint 

g{9) = c 0 - c 0 sgnc 0 < 0 

0=[9j,d 2 t ,03,c o] t 

Projection 

operator 

|Tx if co(t)| > c 0 or 

P r [rx]=J ifjco(t) =c Q and (rx) T Vs- < 0 

[rx-T vffvqPx otherwise 

A. Gradient Algorithm 

Adaptive law 

9 = Pr[Te(j) p ] 

Design variable 

r = r T > 0 

B. Integral gradient a Algorithm 

Adaptive law 

0 = Pr[-T(R9 + Q)\ 

R= /LR+^F, 12(0) = 0 

0—HQ-gz, 0(0) Lo 

2 = W m (s)u p 

Design 

variable 

r = r T > o,/3 > o 

C. Least-squares with covariance resetting 

Adaptive law 

0 = Pr[Pe(j) p \ 

. \ if |co(t)| > c 0 or 

P =S if |c 0 (t)| = c 0 and ( Pe(j) p ) T y g < 0 

(0 otherwise 

P(t + ) = Po = Pol 

Design 

variable 

P( 0 ) = P 1 (0) > 0, t r is the time for which 
^min(P(t)') — Pit Pi ^ P0 ^ 0 

Common signals and variables 

e - m{ J- 2 P ,z- 0 T 4 > p ,m 2 - 1 +(f>J(f> p 

4> p = [W m (s)ujJ 1 W m (s)ujJ, W m (s)y p , y p ] T 

|co(0)| > c o ,0 < c 0 < Cq , sgn(c 0 (0)) = sgn {k p /k m ) 
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Table 6.7 Hybrid MR AC 


Parametric 

model 

W m (s)u p = 9* T (j) p 

Control law 

u>i = Fuji + gu p 

lo 2 = Fuj 2 + gy P 

u p (t) = 0ju;(t), te(t k ,tk+ 1 ] 

Hybrid 
adaptive law 

9k+\ = Pf {^fc+r// fc fc+ 1 e(r )(f> p (T)dr j ; k=0, 1 , 2 ,... 

e= m( J 2 p , z(t) = 9j(j) p , te(t k ,t k+ 1 ] 
m 2 = 1 + (34>J 4> p 

where Pr[-\ is the discrete-time projection 

p r f x \ ^{ x if l x °l ^ ^ 

! \ x + ^(-0 _ • T o)sgn(c 0 ) otherwise 

where xo is the last element of the vector x; 70 is 
the last column of the matrix T, and 700 is the last 
element of 70 

Design 

variables 

P > i; t k = kT s , t s > 0; r = r T > o 

2 - T s \ rnax (T) > 7 for some 7 > 0 


Outline of Proof The proof follows the same procedure as that for the ex- 
amples presented in Sections 6.5.1, 6.5.2. It is completed in five steps. 

Step 1. Express the plant input and output in terms of the adaptation error 
9 T lu. Using Figure 6.14 we can verify that the transfer function between the input 
r + \9' u> and the plant output y„ is given by 

c o 

Up = G c (s ) (r+ \d T u 
\ c o 


where 


G c (s) = 


c 0 kpZ p 


(1 - dl T {sI - F)~ 1 g)R p - k p Z p [9* 2 T (sI - F)~ 1 g + 0* 3 ] 


Because of the matching equations (6.3.12) and (6.3.13), and the fact that (si - 
F)~ 1 g = we have, after cancellation of all the stable common zeros and poles 
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Figure 6.13 Block diagram of direct MRAC with normalized adaptive 
law. 

in G c (s), that G c (s ) = W m (s). Therefore, the plant output may be written as 

y P = W m {s) (r + ^9 T (6.5.42) 

Because y p = G p (s)u p and G~ 1 (s) has stable poles, we have 

u p = G~ 1 (s)W m (s) ^r+ (6.5.43) 

where G~ 1 (s)W m (s) is biproper Because of Assumption M2. 

We now define the fictitious normalizing signal mf as 

TO / = 1 + IKH 2 + IMI 2 (6.5.44) 

where || • || denotes the ^ 25 -norm for some S > 0. Using the properties of the £ 2<5 
norm it follows that 

m.f < c+ c\\d T u\\ (6.5.45) 
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Figure 6.14 Equivalent representation of the MRAC scheme of Table 6.4. 


where c is used to denote any finite constant and S > 0 is such that W m (s — 
|),G“ 1 (s — |) have stable poles. Furthermore, for 9 € (guaranteed by the 
adaptive law), the signal nif bounds most of the signals and their derivatives from 
above. 

Step 2. Use the Swapping Lemmas and properties of the C 25 norm to upper 
bound \\9 T u\\ with terms that are guaranteed by the adaptive law to have finite 
C 2 gains. This is the most complicated step and it involves the use of Swapping 
Lemmas A.l and A. 2 to obtain the inequality 

ll^ll < —mf + caf\\gmf\\ (6.5.46) 

«o 


where g 2 = e 2 n 2 + \Q \ 2 + e 2 and g is guaranteed by the adaptive law to belong to 
£2 and «o > 0 is an arbitrary constant to be chosen. 

Step 3. Use the B-G Lemma to establish boundedness. From (6.5.45) and 
(6.5.46), it follows that 


mj < c + A^m 2 + cal n ||gm/|| 2 
a b 


(6.5.47) 


or 


m 2 < c + c 


«o n * 9 2 (T)m 2 f (T)dT 


0 
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for any «o > ccq and some «g > 0. 

Applying the B-G Lemma and using g £ £2, the boundedness of m/ follows. 
Using m.f £ C^, we establish the boundedness of all the signals in the closed-loop 
plant. 

Step 4 . Show that the tracking error converge to zero. The convergence of e\ 
to zero is established by showing that e\ £ £2 and ei £ and using Lemma 3.2.5. 


Step 5 . Establish that the parameter error converges to zero. The convergence 
of the estimated parameters to their true values is established by first showing that 
the signal vector <f> or <f> p , which drives the adaptive law under consideration, can 
be expressed as 

(f> or 4> p = H{s)r + <f> 

where H(s) is a stable transfer matrix and (f> £ £ 2 . If r is sufficiently rich of 
order 2 n and Z p , R p are coprime then it follows from the results of Section 6.4 that 
(f> m = H(s)r is PE, which implies that (f> or </> p is PE. The PE property of f> p or 
guarantees that 6 and e\ converge to zero as shown in Chapter 4. 

A detailed proof of Theorem 6.5.1 is given in Section 6.8. □ 

The MRAC scheme of Table 6.4 with any adaptive law from Table 6.5 
guarantees that £ £ £2 which together with e £ £2 implies that p £ C 1 , i.e. , 

£ \p\dr < 7 (^°° e 2 dr) 5 ( jT edr^j 5 < 00 

which, in turn, implies that p(t) converges to a constant as t — > 00 indepen- 
dent of the richness of r. 

The stability properties of the hybrid MRAC scheme of Table 6.7 are 
similar to the continuous-time MRAC schemes and are summarized by the 
following Theorem. 

Theorem 6 . 5.2 The closed-loop system obtained by applying the hybrid 
MRAC scheme of Table 6.7 to the plant given in Table 6.4 has the following 
properties: 

(i) All signals are bounded. 

(ii) The tracking error e\ converges to zero as t — > 00. 

(iii) If r is sufficiently rich of order 2 n and Z p , R p are coprime, then the 
parameter error = 0 k — 6* converges exponentially fast to zero as k, 
t — > 00. 
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Proof First, we show that the projection algorithm used in Table 6.7 guarantees 
|c 0 fc| > c 0 Vk > 0 without affecting (i) to (iii) of the hybrid adaptive law established 
in Theorem 4.6.1. We rewrite the adaptive law of Table 6.7 as 


9k+ 1 

= 

0 p k+1 + Ap 

°l + 1 

A 

rtk+ 

9 k + r / 



Jtk 

Ap 

A 

S 0 


1 



if l C 0(k+l)l — -0 

otherwise 


(6.5.48) 


where c^ fe+1 ^ is the last element of the vector 9 k+1 . We can view 9 k+1 , 9 k +i as 
the pre- and post-projection estimate of 9* respectively. It is obvious, from the 
definition of 70,700 that c 0 ( fc+ i) = c 0 sgn(c 0 ) if |cg (fe+1) | < c 0 and c 0 ( fc+ i) = c£ (fc+1) 
if |cg (fc+1) | > c 0 . Therefore, the constraint cofc > c 0 is satisfied for all k. 

Now consider the same Lyapunov function used in proving Theorem 4.6.1, i.e. , 


v(k) = 9^ r -% 


we have 


V(k + 1) = 9ll 1 T- 1 9l +1 + 29l T +1 T- 1 Ap + (Ap) T r- 1 Ap (6.5.49) 

In the proof of Theorem 4.6.1, we have shown that the first term in (6.5.49) satisfies 


npT r -12p 
°k+ 1 1 U k+ 1 


< V(k) - (2 - T s X m ) 



e 2 (r)?n 2 (r)(ir 


(6.5.50) 


For simplicity, let us consider the case sgn(co) = 1. Exactly the same analysis can 
be carried out when sgn(co) = —1. Using r _1 r = / and the definition of 70,700, 
we have 

' 0 ' 

0 


F-i 


7o = 


0 

1 


Therefore, for c^^ k+1 ^ < c 0 , the last two terms in (6.5.49) 


can be expressed as 


29ll 1 T- 1 Ap+(Ap) T T- 1 Ap 


(^(fe+l) C o)(£o Co) + (Co Co(A,_|_ l)) 2 

7oo v ’ 7oo ’ 

(c 0 — c 0 )(c 0 + — 2c 0 ) < 0 
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where the last inequality follows because Cq > c 0 > c(^ fc+1 ,. For cL fc+1 ^ > c 0 , we 
have 2^ i r~ 1 Ap+ (Ap) T T _1 Ap = 0. Hence, 

20fe+ i r” 1 Ap+(Ap) T r- 1 Ap< 0, Vfc>0 (6.5.51) 

Combining (6.5.49)-(6.5.51), we have 

A ftk+1 

A V(k) = V(k + 1) - V’(jfe) < -(2 - T s \ m ) / e 2 (r)m 2 (r)dr 

Jt k 

which is similar to (4.6.9) established for the hybrid adaptive law without projection. 
Therefore, the projection does not affect the ideal asymptotic properties of the 
hybrid adaptive law. 

The rest of the stability proof is similar to that of Theorem 6.5.1 and is briefly 
outlined as follows: Noting that 9{t) = 9k, it £ [kT s , (k + 1 )T S ) is a piecewise con- 
stant function with discontinuities at t = kT s , k = 0,1,..., and not differentiable, 
we make the following change in the proof to accommodate the discontinuity in 
9k- We write 9(t) = 9(t) + ( 9(t ) — 0(f)) where 9(t) is obtained by linearly inter- 
polating 9k,9k + i on the interval [kT s , (k + 1)T S ) . Because 9k € C oa ,A9k £ C 2 , 
we can show that 9 has the following properties: (i) 9 is continuous, (ii) 9(t) = 
9 k + 1 - Skit £ [ kT s , (k + 1 )T S ), and 9 £ f| £2, (hi) \9(t) - 9(t)\ < \9 k +i - 9 k \ 
and 1 9{t) — 9{t) \ £ £00 0^2- Therefore, we can use 9 in the place of 9 and the 
error resulting from this substitution is (9 — 9) t lu , which has an £2 gain because 
\9 — 9\ £ £2. The rest of the proof is then the same as that for the continuous 
scheme except that an additional term (9 — 9) T u> appears in the equations. This 
term, however, doesnot affect the stability analysis since it has an £2 gain. □ 


Remark 6.5.1 In the analysis of the MRAC schemes in this section we 
assume that the nonlinear differential equations describing the stability 
properties of the schemes possess a unique solution. This assumption 
is essential for the validity of our analysis. We can establish that these 
differential equations do possess a unique solution by using the results 
on existence and uniqueness of solutions of adaptive systems given in 
[191]. 

6.5.4 Effect of Initial Conditions 

The analysis of the direct MRAC schemes with normalized adaptive laws 
presented in the previous sections is based on the assumption that the ini- 
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tial conditions for the plant and reference model are equal to zero. This 
assumption allows us to use transfer function representations and other I/O 
tools that help improve the clarity of presentation. 

Nonzero initial conditions introduce exponentially decaying to zero terms 
in the parametric models (6.5.40) and (6.5.41) as follows: 

ei = W m (s)p*[u p -6* T u \ + e t 
z = r T 0 p + e t (6.5.52) 

where et = Cj (si — A c ) _ 1 e(0) and A c ,e(0) are as defined in Section 6.4. 
Because A c is a stable matrix, the properties of the adaptive laws based on 
(6.5.52) with et = 0 remain unchanged when et 7 ^ 0 as established in Section 
4.3.7. Similarly, the et terms also appear in (6.5.42) and (6.5.43) as follows: 

Up = W m (s)(r + \o t lo) + e t 
c o 

u p = Gp 1 (s)W m (s)(r + —6 T u) + e t (6.5.53) 

c o 

where in this case et denotes exponentially decaying to zero terms because of 
nonzero initial conditions. The exponentially decaying term only contributes 
to the constant in the inequality 

mj < c + e t + -^||m/|| 2 + aft 1 * c\\gm f \\ 2 
«o 

where g G C- 2 - Applying the B-G Lemma to the above inequality we can 
establish, as in the case of e t = 0, that mj £ C 0 0 . Using rrif £ Coo, the 
rest of the analysis follows using the same arguments as in the zero initial 
condition case. 


6.6 Indirect MRAC 

In the previous sections, we used the direct approach to develop stable 
MRAC schemes for controlling a wide class of plants with unknown parame- 
ters. The assumption on the plant and the special form of the controller en- 
abled us to obtain appropriate parameterizations for the unknown controller 
vector 8* that in turn allows us to develop adaptive laws for estimating the 
controller parameter vector 6(t) directly. 
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An alternative way of controlling the same class of plants is to use the 
indirect approach, where the high frequency gain k p and coefficients of the 
plant polynomials Z p (s), R p (s) are estimated and the estimates are, in turn, 
used to determine the controller parameter vector 9{t) at each time t. The 
MRAC schemes based on this approach are referred to as indirect MRAC 
schemes since 6(t) is estimated indirectly using the plant parameter esti- 
mates. 

The block diagram of an indirect MRAC scheme is shown in Figure 6.15. 
The coefficients of the plant polynomials Z p (s),R p (s) and high frequency 
gain k p are represented by the vector 9*. The on-line estimate 9 p (t) of 0*, 
generated by an adaptive law, is used to calculate the controller parameter 
vector 9(t) at each time t using the same mapping / : 6 P i— ► 9 as the mapping 
/ : 9* i— ► 9* defined by the matching equations (6.3.12), (6.3.16), (6.3.17). 
The adaptive law generating 9 P may share the same filtered values of u p ,y p , 
i.e. , u i , uj ‘2 as the control law leading to some further interconnections not 
shown in Figure 6.15. 

In the following sections we develop a wide class of indirect MRAC 
schemes that are based on the same assumptions and have the same sta- 
bility properties as their counterparts direct MRAC schemes developed in 
Sections 6.4 and 6.5. 

6.6.1 Scalar Example 

Let us consider the plant 

x = ax + bu (6.6.1) 

where a, b are unknown constants and sgn(6) is known. It is desired to choose 
u such that all signals in the closed-loop plant are bounded and the plant 
state x tracks the state x m of the reference model 

X m = U j mNm, A b m v (6.6.2) 

where a m >0, b m and the reference input signal r are chosen so that x m (t) 
represents the desired state response of the plant. 

Control Law As in Section 6.2.2, if the plant parameters a, b were known, 
the control law 


u = —k*x + Tr 


(6.6.3) 
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Figure 6.15 Block diagram of an indirect MRAC scheme. 


; * b m 

k 1 ~ T 


(6.6.4) 


could be used to meet the control objective. In the unknown parameter case, 


we propose 


u = —k{t)x + l{t)r 


(6.6.5) 


where k(t),l(t ) are the on-line estimates of k*,l* at time t, respectively. In 
direct adaptive control, k(t),l(t) are generated directly by an adaptive law. 
In indirect adaptive control, we follow a different approach. We evaluate 
k(t),l(t) by using the relationship (6.6.4) and the estimates d, b of the un- 
known parameters a, b as follows: 


k(t) 


dm + d(t) 


b{t) 



( 6 . 6 . 6 ) 
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where a, b are generated by an adaptive law that we design. 

Adaptive Law The adaptive law for generating a, b is obtained by following 
the same procedure as in the identification examples of Chapter 4, i.e., we 
rewrite (6.6.1) as 

x = [(a + a m )x + bu\ 

S + dm 

and generate x, the estimate of x. from 

x = — 1 — [(a + a m )x + bu\ = x m (6.6.7) 

S T CLm 

where the last equality is obtained by using (6.6.5), (6.6.6). As in Section 
6.2.2, the estimation error e± = x — x m = e\ is the same as the tracking error 
and satisfies the differential equation 

e\ = — o m ei — ax — bu (6.6.8) 


where 


- A A 
a = a — a, 



b 


are the parameter errors. Equation (6.6.8) motivates the choice of 


V = 


1 

2 



+ 



(6.6.9) 


for some 71,72 > 0, as a potential Lyapunov-like function candidate for 
(6.6.8). The time derivative of V along any trajectory of (6.6.8) is given by 


Hence, for 


1 — d m 


2 7 aa bb 

e 1 — axe 1 — bue 1 -| 1 

7i 72 


a = a = yieix, b = b = 


( 6 . 6 . 10 ) 


(6.6.11) 


we have 

V = — a m ei < 0 

which implies that ei,a,fe G and that ei G £2 by following the usual 
arguments. Furthermore, x m , e 1 G £00 imply that x G £00 • The bound- 
edness of u, however, cannot be established unless we show that k(t),l(t) 



400 


CHAPTER 6. MODEL REFERENCE ADAPTIVE CONTROL 


are bounded. The boundedness of | and therefore of k(t),l(t ) cannot be 
guaranteed by the adaptive law (6.6.11) because (6.6.11) may generate esti- 
mates 6(i ) arbitrarily close or even equal to zero. The requirement that 6(t ) 
is bounded away from zero is a controllability condition for the estimated 
plant that the control law (6.6.5) is designed for. One method for avoid- 
ing b(t) going through zero is to modify the adaptive law for 6(i) so that 
adaptation takes place in a closed subset of 1Z 1 which doesnot include the 
zero element. Such a modification is achieved by using the following a priori 
knowledge: 

The sgn(6) and a lower bound bo > 0 for |6| is known (A2) 

Applying the projection method with the constraint b sgn (6) > bo to the 
adaptive law (6.6.11), we obtain 

( if |6| > bo or 

a = 7 ieix, 6 = < 1 if |6| = bo and e\ u sgn(6) > 0 (6.6.12) 

I 0 otherwise 


where 6(0) is chosen so that 6(0)sgn(6) > bo- 

Analysis It follows from (6.6.12) that if 5(0)sgn(6) > 6o, then whenever 
6(t)sgn(6) = | b(t) | becomes equal to bo we have 66 > 0 which implies that 
|6(t)| > bo, Vf > 0. Furthermore the time derivative of (6.6.9) along the 
trajectory of (6.6.8), (6.6.12) satisfies 


V = 


—a m e\ if |6| > bo or |6| = bo and e\ u sgn(6) > 0 

— a m e\ — be\u if |6| = bo and e\ u sgn(6) < 0 


Now for |6| = bo, we have (6 — 6)sgn(6) < 0. Therefore, for |6| = bo and 
e\u sgn(6) < 0, we have 

be\u = (6 — b)e\u = (6 — 6)sgn(6)(ei«sgn(6)) > 0 


which implies that 

V < —a m ef <0, Vt > 0 


Therefore, the function V given by (6.6.9) is a Lyapunov function for the 
system (6.6.8), (6.6.12) since u, x in (6.6.8) can be expressed in terms of e\ 
and x rn where x m {t ) is treated as an arbitrary bounded function of time. 
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Reference Model 



Figure 6.16 Block diagram for implementing the indirect MRAC scheme 
given by (6.6.5), (6.6.6), and (6.6.12). 


Hence the equilibrium ei e = 0 ,a e = a,b e = b is u.s. and e\ ,b,a £ Coo. 
Using the usual arguments, we have e\ £ £2 and ei £ Coo which imply that 
e\ (t) = x(t) — x m (t) — > 0 as t — > 00 and therefore that a(t),b(t) -> 0 as 
t — > 00 . 

As in the direct case it can be shown that if the reference input signal 
r(t) is sufficiently rich of order 2 then b , a and, therefore, k, l converge to 
zero exponentially fast. 

Implementation The proposed indirect MRAC scheme for (6.6.1) described 
by (6.6.5), (6.6.6), and (6.6.12) is implemented as shown in Figure 6.16. 

6.6.2 Indirect MRAC with Unnormalized Adaptive Laws 

As in the case of direct MRAC considered in Section 6.5, we are interested 
in extending the indirect MRAC scheme for the scalar plant of Section 6.6.1 
to a higher order plant. The basic features of the scheme of Section 6.6.1 is 
that the adaptive law is driven by the tracking error and a single Lyapunov 
function is used to design the adaptive law and establish signal boundedness. 
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In this section, we extend the results of Section 6.6.1 to plants with 
relative degree n* = 1. The same methodology is applicable to the case of 
n* > 2 at the expense of additional algebraic manipulations. We assign these 
more complex cases as problems for the ambitious reader in the problem 
section. 

Let us start by considering the same plant and control objective as in 
the direct MRAC scheme of Section 6.4.1 where the relative degree of the 
plant is assumed to be n* = 1. We propose the same control law 


io\ = Fuji PgUp, wi(0) = 0 

lu 2 = Fu 2 + gy p , W 2 (0) = 0 (6.6.13) 

Up = 9 t u) 


as in the direct MRAC case where 9(t) is calculated using the estimate of k p 
and the estimates of the coefficients of the plant polynomials Z p (s),R p (s), 
represented by the vector 9 p (t), at each time t. Our goal is to develop an 
adaptive law that generates the estimate 9 p (t) and specify the mapping from 
9 p (t) to 9(t) that allows us to calculate 9(t) at each time t. We start with 


the mapping that relates the unknown vectors 9* = \9\ ,9*2, 9%, Cq 
9* specified by the matching equations (6.3.12), (6.3.16), and (6.3.17 
Q(s ) = 1 due to n* = 1 ), i.e. , 


T 


and 

(with 


c o = 


^ T a(s) = 


6 * 2 T a(s) + $3 A(s) — 


km 

kp 

A (s) - Z p (s), 

-Rp(s) Ao (s)7?m(s) 

krt 


(6.6.14) 


To simplify (6.6.14) further, we express Z p (s), R p (s), A(s), Aq (s)R m (s) 


as 


Z p (s) 

= s n l +pja n - 2 {s ) 

R p {s) 

= s n + a n _is n_1 +pja n _ 2 (s 

Ms) 

= s n 1 + A T a n _ 2 (s) 

A 0 (s)R m (s) 

= s n + r n _i.s n_1 + v T a n - 2 (s 


where pi, p 2 CR n ]a n - 1 are the plant parameters, i.e., 9* = [k p , pj, a n - 1 , pj] T ; 
A, v £ and r n _i are the coefficients of the known polynomials A(s), 
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Ao (s)Rm(s) and a n -2 (s) = [s n 2 , s n 3 , . . . , s, l] T , which we then substitute 
in (6.6.14) to obtain the equations 


c o 

6 { 

0*2 

0*3 



X- Pi 

p 2 - a n -i\ + r n -iX - v 



(6.6.15) 


If we let kp(t),pi(t),p2(t),a n -i(t) be the estimate of k p ,p\,p2, a n -i re- 
spectively at each time t, then 9(t) = [0^, 6j , 63, co] T may be calculated 
as 


co(t) 

0i(t) 

02(t) 

o 3 (t) 


km 

k p (t) 

X - Pi(t) 

P2{t) - a n -i(t)X + r n - iA - v 
kp(t) 

O-n—l it') fn—1 

k p (t) 


(6.6.16) 


provided \k p (t)\ 7^ 0, \/t > 0. 

The adaptive laws for generating pi,p2,a n -i,k p on-line can be devel- 
oped by using the techniques of Chapter 4. In this section we concentrate 
on adaptive laws that are driven by the tracking error e\ rather than the 
normalized estimation error, and are developed using the SPR-Lyapunov 
design approach. We start with the parametric model given by equation 
(6.4.6), i.e., 

ei = W m {s)p*(u p -6* T u) (6.6.17) 

where p* = = r 2 -. As in the direct case, we choose W m (s) . the transfer 

' Cq Km v 7 

function of the reference model, to be SPR with relative degree n* = 1. 
The adaptive law for 0 p = [k p ,pj ,a n - i,pJ] T is developed by first relating 
ei with the parameter error 0 P = 9 P — 9* through the SPR transfer function 
W m (s) and then proceeding with the Lyapunov design approach as follows: 
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We rewrite (6.6.17) as 


ei = W m (s )- — (kpUp — k p 0* T (v — k p u p + k p 9 T ui) (6.6.18) 

Km ' ' 


where —k p u p + k p 9 T ui = 0 because of (6.6.13). If we now substitute for 
k p 9* ,k p 0 from (6.6.15) and (6.6.16), respectively, in (6.6.18) we obtain 

ei= W m (s) kJ^X T uJi-UpjLpJ W 2 +On-i(yp-A T w 2 )- k p pj ui+kppj uq 

where k p = k p — k p ,p -2 = p 2 ~ P 2 ,o, n -i = a n - 1 — a n _ i are the parameter 
errors. Because — k p pj oj\ + k p pj uq + k p pj — kppj lo\ = —k p pj — k p pj u>\ 
we have 


d = W m (s) — k p i i + a„_i£ 2 + P 2 ^2 ~ k p p{ lo 1 

Km. L 


(6.6.19) 


where 


t A T -T t A \T 

U = A LUl- Up- p l Ul, & = y P -X 0 J 2 , PI = PI- PI 


A minimal state-space representation of (6.6.19) is given by 


e 

ei 


A c e + B c 
Cje 


kp^i + a n _ 1^2 +P 2 U 2 - k p pjuj 1 


( 6 . 6 . 20 ) 


where Cj (si — A c ) 1 B C = W m (s) ^ . Defining the Lyapunov-like function 
„_e T P c e , kp a\_ x | pjT^ l pi {u , | 

v -^ + 2 > + ^r + 2 lkpl+ 2 

where P c = Pj > 0 satisfies the algebraic equations of the LKY Lemma, 
7i , 7p > 0 and T,; = T ? t > 0, i = 1, 2, it follows that by choosing the adaptive 
laws 


Q>n— 1 


Pi 

P2 

kp 


-7i g i6 

rieicui sgn (k p ) 

-r 2 ei u>2 (6.6.21) 

! -7 P ei£i if \k P \ > k ° 

or if \k p \ = ko and ei£i sgn (k p ) < 0 
0 otherwise 
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where k p (Q) sgn (k p ) > ko > 0 and ko is a known lower bound for \k p \, we 
have 

! — e T2 |-e — is c e T if \k p \ > 0 or if \k p \ = ko and 

ei£i sgn(fcp) < 0 

— e T2 |-e — v c e T ^fe + ei£i k p if \k p \ = ko and ei£i sgn(fc p ) > 0 

where the scalar v c > 0, matrix L c = Lj > 0 and vector q are defined in the 
LKY Lemma. 

Because for ei£i sgn (k p ) > 0 and \k p \ = ko we have ( k p — k p ) sgn (k p ) < 0 
and ei£i k p < 0, it follows that 


V<-v c 


e T L c e 


2 


which implies that ei, e, a n -i,pi,p 2 , k p e and e, e\ G £ 2 - As in Section 
6.4.1, e\ G £00 implies that y p ,uJi,uj 2 G £ 00 , which, together with 0 G £00 
(guaranteed by (6.6.16) and the boundedness of 9 pi J-), implies that u p G 

Kp 

£oo. Therefore, all signals in the closed-loop system are bounded. The 
convergence of ei to zero follows from e\ G £2 and ei G £00 guaranteed by 
(6.6.18). 

We summarize the stability properties of above scheme, whose main equa- 
tions are listed in Table 6.8, by the following theorem. 


Theorem 6.6.1 The indirect MRAC scheme shown in Table 6.8 guarantees 
that all signals are u.b., and the tracking error ei converges to zero as t — > 00 . 

We should note that the properties of Theorem 6.6.1 are established un- 
der the assumption that the calculation of 6{t) is performed instantaneously. 
This assumption is quite reasonable if we consider implementation with a 
fast computer. However, we can relax this assumption by using a hybrid 
adaptive law to update 9 P at discrete instants of time thus providing suffi- 
cient time for calculating 9{t). The details of such a hybrid scheme are left 
as an exercise for the reader. 

As in the case of the direct MRAC schemes with unnormalized adaptive 
laws, the complexity of the indirect MRAC without normalization increases 
with the relative degree n* of the plant. The details of such schemes for the 
case of n* = 2 and higher are given as exercises in the problem section. 
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Table 6.8 Indirect MRAC scheme with unnormalized adaptive 

law for n* = 1 


Plant 

Vp ~ k PR p (s) U P’ n — 1 

Reference 

model 

Urn = W m (s)r, W m (s ) = 

Control law 

6j\ = Fuji + gup 
il >2 = Fuj 2 + gy p 

Up = 6 t lu 

0= Oj,0j,e 3 ,c o ;cu = [ujJ ,ujJ ,y p ,r] T 

Adaptive law 

. f-7p e i6 if \k P \ > k 0 or 

k p = < if \k p \ = ko and ei^isgn(fc p ) < 0 

10 otherwise 

On-l = ~ 7l e lC2 

Pi = rieiwi sgn(fcp) 
i>2 = —T2e\(jJ2 

6l = Vp Pm 

6 = A T cui - Up - pJujv, & = Vp - A t (U2 

Calculation 
of 6(t ) 

Co(t) = k m /kp(t ) 

0i (i) = A -pi(t) 

02 (i) = (P 2 (i) - a n _i(t)A + r n _iA - u)/k p (t ) 

0 3 (t) = (a„_i(t) - r n -i)/k p (t) 

Design 

variables 

k 0 : lower bound for \k p \ > Aiq >0; A £ 7?" _1 : coeffi- 
cient vector of A(s) — s n_1 ; r n _i G IZ 1 : coefficient of 
s n_1 in Ao (s)R m (s); v € 1Z n ~ l : coefficient vector of 
Ao(s)-R m (s) — s 11 — r n _is” _1 ; A(s),Ao(s) as defined in 
Section 6.4.1 
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Another interesting class of indirect MRAC schemes with unnormalized 
adaptive laws is developed in [116, 117, 118] using a systematic recursive 
procedure, called backstepping. The procedure is based on a specific state 
space representation of the plant and leads to control and adaptive laws that 
are highly nonlinear. 


6.6.3 Indirect MRAC with Normalized Adaptive Law 

As in the direct MRAC case, the design of indirect MRAC with normalized 
adaptive laws is conceptually simple. The simplicity arises from the fact that 
the control and adaptive laws are designed independently and are combined 
using the certainty equivalence approach. As mentioned earlier, in indirect 
MRAC the adaptive law is designed to provide on-line estimates of the high 
frequency gain k p and of the coefficients of the plant polynomials Z p (s), R P {s) 
by processing the plant input and output measurements. These estimates 
are used to compute the controller parameters at each time t by using the 
relationships defined by the matching equations (6.3.12), (6.3.16), (6.3.17). 

The adaptive law is developed by first expressing the plant in the form 
of a linear parametric model as shown in Chapters 2, 4, and 5. Starting with 
the plant equation (6.3.2) that we express in the form 

_ b m s m + b m -is m 1 + • • • + bo 
Vp ~ s n + a n -is n ~ l + ■ ■ ■ + a G Up 

where b m = k p is the high frequency gain, and using the results of Sec- 
tion 2.4.1 we obtain the following plant parametric model: 

z = 9* T (j) (6.6.22) 


where 



7i— m— 1 


and Ap(s) = s n + Aj a n _ i (s) with \ p = [A n _i, . . . , Ao] T is a Hurwitz polyno- 
mial. Since in this case m is known, the first n — m — 1 elements of 9* are 
known to be equal to zero. 
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The parametric model (6.6.22) may be used to generate a wide class of 
adaptive laws by using Tables 4.2 to 4.5 from Chapter 4. Using the estimate 
9 p{t) of 9*, the MRAC law may be formed as follows: 

The controller parameter vectors 9\{t), #2(0> 9 3 (t), co(t) in the control 

law 

— n( A — rv( A 

(6.6.23) 


-r cr(s) T q(s) t 

Un = &1 M/) Up + A [s) Vp + °^ Vp + c ° r = 0 u 


where c o = 


•v 

q t Q)„. a ' 00, 


A (s) U P ’ A(s) yp’VpH 


T 


a{s) = a n _ 2 (s) and 9= [9j , 9j , 9 3 , c 0 ] T is 
calculated using the mapping 9(t) = f(9 p (t)). The mapping /(•) is obtained 
by using the matching equations (6.3.12), (6.3.16), (6.3.17), i.e. , 

. km. 


c o = 


k r , 


9fa(s) + 9* 3 A(s) = 


9* a(s) = A(s) - Z p (s)Q(s) 

Q(s)R p {s ) - A 0 {s)R m {s) 


(6.6.24) 


kr, 


where Q(s) is the quotient of and A (s) = Ao (s)Z m (s). That is, if 

Rp(s,t), Z p (s,t) are the estimated values of the polynomials R p {s), Z p {s ) = 
kpZ p {s ) respectively at each time t, then co,#i,# 2,#3 are obtained as solu- 
tions to the following polynomial equations: 

km. 


Co = 


kn 


9ja(s ) = A (s) - -!-Zp(s,t) ■ Q{s,t) 

k p 

d]a(s) + 9 3 A{s) = ^~[Q(s,t) ■ Rp(s,t) - A 0 (s)i? m (s)] 

kp 


(6.6.25) 


provided k p / 0, where Q(s,t) is the quotient of A< • Here A(s,t) 


A 0 (s)R m (s) 

Rp{s,t) 

B(s,t ) denotes the frozen time product of two operators A(s,t), B(s,t). 
The polynomials R p (s,t), Z p (s,t) are evaluated from the estimate 

9 P — [0 , . . . , 0 , b m , ... ,b o , On— !)•••! a 0 ] 


n—m—1 


of 9* i.e., 


Rp(s, t) — s n + a n _is n + • • • + ao 
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Z p (s , t ) — b m s m + b m -is' n 1 + • • • + 6q 



As in Section 6.6.2, the estimate b m = k p should be constrained from going 
through zero by using projection. 

The equations of the indirect MRAC scheme are described by (6.6.23), 
(6.6.25) where 0 is generated by any adaptive law from Tables 4.2 to 4.5 based 
on the parametric model (6.6.22). Table 6.9 summarizes the main equations 
of an indirect MRAC scheme with the gradient algorithm as the adaptive 
law. Its stability properties are summarized by the following theorem: 

Theorem 6.6.2 The indirect MRAC scheme summarized in Table 6.9 guar- 
antees that all signals are u.b., and the tracking error e\ = y p — y m converges 
to zero as t — ► oo. 

The proof of Theorem 6.6.2 is more complex than that for a direct MRAC 
scheme due to the nonlinear transformation 9 P i— ► 9. The details of the 
proof are given in Section 6.8. The number of filters required to generate 
the signals u, (j) in Table 6.9 may be reduced from 4n — 2 to 2 n by selecting 
A p (s) = (s + Ao)A(s) for some Ao > 0 and sharing common signals in the 
control and adaptive law. 

Remark 6.6.1 Instead of the gradient algorithm, a least-squares or a hy- 
brid adaptive law may be used in Table 6.9. The hybrid adaptive law 
will simplify the computations considerably since the controller param- 
eter vector 9 will be calculated only at discrete points of time rather 
than continuously. 

The indirect MRAC scheme has certain advantages over the correspond- 
ing direct scheme. First, the order of the adaptive law in the indirect case 
is n + m + 1 compared to 2 n in the direct case. Second, the indirect scheme 
allows us to utilize any apriori information about the plant parameters to 
initialize the parameter estimates or even reduce the order of the adaptive 
law further as indicated by the following example. 

Example 6.6.1 Consider the third order plant 

1 

s 2 (s + a) Up 


Vp = 


(6.6.26) 
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Table 6.9 Indirect MRAC scheme with normalized adaptive law 


Plant 

Up — Gp(s)u p , G p (s) — Zp(s)/Rp(s), Z p (s) — kpZp^s ) 

Reference 

model 

n — k Zm O r 

y,n - ^m Rrn ^ s y 

Control 

law 

Up = 0 1 uj 

U) = uJ,u}J,y p ,r , 0= 8j,dj,6 3 ,c 0 

CXn —o(s) (Tri—ofsl 

^1= A(») “P >W2= Ms) Vp 

Adaptive 

law 

p 1 = Ti4> 1 e, p 2 = r 2 </> 2 e 
. f 7mC(/>i m if \k p \ > ko or 

k p = < if \k p \ = ko and 4>i m esgn(k p ) > 0 

[ 0 otherwise 

k p (0)sgn(kp) > k 0 > 0 
e = (z - z)/m 2 ,z = y p + Aj</> 2 
z = 8p(j), m 2 = 1 + </> T </>, f = [<j>J,<$] T 

6 p = [0 ,.„,0;k p ,pj ,pJ] T 

n—m—1 

1 _ atn-l (s) j _ a n-l(s) 

vl — Ap(s) P2 — Ap(s) y P 

<t>i = [<K,<t>T} T , 00 en m 

<t>im £ R 1 is the last element of <?!>o 

Pi = [bm-l, ■ ■ ■ , bo] T ,P 2 = [fin-1, •••■■> ®o] T 

Z p (s, t ) = kps m +pj a m -i(s), Rp(s, t ) = a„-i(s) 

Calculation 
of 6 

^ - Km 

0j ( t)a n - 2 (s ) = A(s) ^ (t) Z p (s, t) • Q(s, t) 

ej{t)a n - 2 {s) + 6 3 (t)A(s) 

= j ~(Q(s,t) ■ Rp(s,t) - A 0 {s)R m (s)) 

Kp 

Q(s,t ) = quotient of Ao (s)R m (s)/R p (s,t) 

Design 

variables 

ko- lower bound for \k p \ > ko > 0; A p (s): rnonic Hurwitz 
of degree n; For simplicity, A p (s) = (s + Ao)A(s), Ao > 0; 
A(s)=A 0 (s)Z m (s); r 1= rj >0, T 2 = T^ >0; Ti 
r 2 £ 7Z nxn ; X p G lZ n is the coefficient vector of A p (s) — s n 
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where a is the only unknown parameter. The output y p is required to track the 
output of y m of the reference model 


Urn — 


(s + 2)3 r 


The control law is given by 

where 8 = [ 8 \\, $ 12 , # 21 , 822 , 83, co] T € TZ 6 . In direct MRAC, 8 is generated by a 
sixth-order adaptive law. In indirect MRAC, 8 is calculated from the adaptive law 
as follows: 

Using Table 6.9, we have 


8 P = [0, 0,1, a, 0, Of 


a = 7a</> a e 


e = , ,t , , z = 0l<t>, z = y p + \I<j )2 


1 + . 




0 = [01 , 02 ] . 01 = ( s + Ai) 3 ^ 


Up ; 02 — 


[s 2 ,s, 1] T 

(a + Ai)3 

where A p (s) is chosen as A p (s) = (s + A 1 ) 3 , \ p = [3Ai, 3A 2 , Af] T . 

„2 


y P 


<t>a = [ 0 , 0 , 0 , 1 , 0 , 0 , ]</>=-- 


\Vp 


(s + A,)3- 

and 7 a > 0 is a constant. The controller parameter vector is calculated as 


c 0 = 1, 8{ 


— (s + A 1) 2 — Q(s, t) 


9 t 

y 2 


+ 8s(s + A 1) 2 — Q(s , f) • [s 8 * + as 2 ] — (s + Ai) 2 (s + 2)^ 


where Q(s,t ) is the quotient of 


V 


The example demonstrates that for the plant (6.6.26), the indirect scheme 
requires a first order adaptive law whereas the direct scheme requires a sixth- 
order one. 
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6.7 Relaxation of Assumptions in MRAC 

The stability properties of the MRAC schemes of the previous sections are 
based on assumptions PI to P4, given in Section 6.3.1. While these as- 
sumptions are shown to be sufficient for the MRAC schemes to meet the 
control objective, it has often been argued whether they are also necessary. 
We have already shown in the previous sections that assumption P4 can be 
completely relaxed at the expense of a more complex adaptive law, therefore 
P4 is no longer necessary for meeting the MRC objective. In this section we 
summarize some of the attempts to relax assumptions PI to P3 during the 
1980s and early 1990s. 

6.7.1 Assumption PI: Minimum Phase 

This assumption is a consequence of the control objective in the known 
parameter case that requires the closed-loop plant transfer function to be 
equal to that of the reference model. Since this objective can only be achieved 
by cancelling the zeros of the plant and replacing them by those of the 
reference model, Z p (s) has to be Hurwitz, otherwise zero-pole cancellations 
in C + will take place and lead to some unbounded state variables within the 
closed- loop plant. The assumption of minimum phase in MRAC has often 
been considered as one of the limitations of adaptive control in general, rather 
than a consequence of the MRC objective, and caused some confusion to 
researchers outside the adaptive control community. One of the reasons for 
such confusion is that the closed-loop MRAC scheme is a nonlinear dynamic 
system and zero-pole cancellations no longer make much sense. In this case, 
the minimum phase assumption manifests itself as a condition for proving 
that the plant input is bounded by using the boundedness of other signals 
in the closed-loop. 

For the MRC objective and the structures of the MRAC schemes pre- 
sented in the previous sections, the minimum phase assumption seems to be 
not only sufficient, but also necessary for stability. If, however, we modify 
the MRC objective not to include cancellations of unstable plant zeros, then 
it seems reasonable to expect to be able to relax assumption PI. For exam- 
ple, if we can restrict ourselves to changing only the poles of the plant and 
tracking a restricted class of signals whose internal model is known, then we 
may be able to allow plants with unstable zeros. The details of such designs 
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that fall in the category of general pole placement are given in Chapter 7. 
It has often been argued that if we assume that the unstable zeros of the 
plant are known, we can include them to be part of the zeros of the reference 
model and design the MRC or MRAC scheme in a way that allows only the 
cancellation of the stable zeros of the plant. Although such a design seems 
to be straightforward, the analysis of the resulting MRAC scheme requires 
the incorporation of an adaptive law with projection. The projection in turn 
requires the knowledge of a convex set in the parameter space where the es- 
timation is to be constrained. The development of such a convex set in the 
higher order case is quite awkward, if possible. The details of the design and 
analysis of MRAC for plants with known unstable zeros for discrete-time 
plants are given in [88, 194], 

The minimum phase assumption is one of the main drawbacks of MRAC 
for the simple reason that the corresponding discrete-time plant of a sampled 
minimum phase continuous-time plant is often nonminimum phase [14]. 

6.7.2 Assumption P2: Upper Bound for the Plant Order 

The knowledge of an upper bound n for the plant order is used to determine 
the order of the MRC law. This assumption can be completely relaxed if the 
MRC objective is modified. For example, it has been shown in [102, 159, 160] 
that the control objective of regulating the output of a plant of unknown 
order to zero can be achieved by using simple adaptive controllers that are 
based on high-gain feedback, provided the plant is minimum phase and the 
plant relative degree n* is known. The principle behind some of these high- 
gain stabilizing controllers can be explained for a minimum phase plant with 
n* = 1 and arbitrary order as follows: Consider the following minimum phase 
plant with relative degree n* = 1: 

kpZp(s) 

yp = -iw Up 

From root locus arguments, it is clear that the input 

Up = - dy p sgn(k p ) 


with sufficiently large gain 9 will force the closed loop characteristic equation 

Rp(s) + \kp\9Zp(s) = 0 



414 


CHAPTER 6. MODEL REFERENCE ADAPTIVE CONTROL 


to have roots in Re [s] < 0. Having this result in mind, it can be shown that 
the adaptive control law 

u p = - 0y p sgn ( k p ) , 0 = y 2 p 

can stabilize any minimum phase plant with n* = 1 and of arbitrary or- 
der. As n* increases, the structure and analysis of the high gain adaptive 
controllers becomes more complicated [159, 160]. 

Another class of adaptive controllers for regulation that attracted con- 
siderable interest in the research community is based on search methods and 
discontinuous adjustments of the controller gains [58, 137, 147, 148]. Of par- 
ticular theoretical interest is the controller proposed in [137] referred to as 
universal controller that is based on the rather weak assumption that only 
the order n c of a stabilizing linear controller needs to be known for the sta- 
bilization and regulation of the output of the unknown plant to zero. The 
universal controller is based on an automated dense search throughout the 
set of all possible n c -order linear controllers until it passes through a subset 
of stabilizing controllers in a way that ensures asymptotic regulation and the 
termination of the search. One of the drawbacks of the universal controller is 
the possible presence of large overshoots as pointed out in [58] which limits 
its practicality. 

An interesting MRAC scheme that is also based on high gain feedback 
and discontinuous adjustment of controller gains is given in [148]. In this 
case, the MRC objective is modified to allow possible nonzero tracking errors 
that can be forced to be less than a prespecified (arbitrarily small) constant 
after an (arbitrarily short) prespecified period of time, with an (arbitrarily 
small) prespecified upper bound on the amount of overshoot. The only 
assumption made about the unknown plant is that it is minimum phase. 

The adaptive controllers of [58, 137, 147, 148] where the controller gains 
are switched from one constant value to another over intervals of times based 
on some cost criteria and search methods are referred to in [163] as non- 
identifier-based adaptive controllers to distinguish them from the class of 
identifier-based ones that are studied in this book. 

6.7.3 Assumption P3: Known Relative Degree n* 

The knowledge of the relative degree n* of the plant is used in the MRAC 
schemes of the previous sections in order to develop control laws that are free 
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of differentiators. This assumption may be relaxed at the expense of addi- 
tional complexity in the control and adaptive laws. For the identifier-based 
schemes, several approaches have been proposed that require the knowledge 
of an upper bound n* for n* [157, 163, 217]. In the approach of [163], n* 
parameterized controllers C % , i = 1,2, ... ,n* are constructed in a way that 
Ci can meet the MRC objective for a reference model Mi of relative degree i 
when the unknown plant has a relative degree i. A switching logic with hys- 
teresis is then designed that switches from one controller to another based 
on some error criteria. It is established in [165] that switching stops in finite 
time and the MRC objective is met exactly. 

In another approach given in [217], the knowledge of an upper bound n* 
and lower bound of n* are used to construct a feedforward dynamic term 
that replaces c Q r in the standard MRC law, i.e. , u p is chosen as 

Up = 9j + @2 + ^3Vp + h{s)ni(s)W m (s)r 

where b\(s ) = l,s, ...,s n * , n* = n* — n* , 9% G IZ n * +1 and roi(s) is an 

arbitrary rnonic Hurwitz polynomial of degree n*. The relative degree of 
the transfer function W m (s) of the reference model is chosen to be equal to 
n* . It can be shown that for some constant vectors the MRC 

objective is achieved exactly, provided 9% is chosen so that kpO^ b\{s) is a 
rnonic Hurwitz polynomial of degree n* — n J < n* , which implies that the 
last n* — n* — elements of 9* are equal to zero. 

The on-line estimate 9i of 9* , i = 1 , . . . , 4 is generated by an adaptive law 
designed by following the procedure of the previous sections. The adaptive 
law for $ 4 , however, is modified using projection so that #4 is constrained 
to be inside a convex set C which guarantees that k p 9jbi(s) is a rnonic 
Hurwitz polynomial at each time t. The development of such set is trivial 
when the uncertainty in the relative degree, i.e., n* is less or equal to 2. For 
uncertainties greater than 2 , the calculation of C, however, is quite involved. 

6.7.4 Tunability 

The concept of tunability introduced by Morse in [161] is a convenient tool for 
analyzing both identifier and nonidentifier-based adaptive controllers and for 
discussing the various questions that arise in conjunction with assumptions 
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PI to P4 in MRAC. Most of the adaptive control schemes may be represented 
by the equations 

x = A(9)x + B (9)r 

ei = C{9)x (6.7.1) 

where 9 : 1Z + i— > 7 Z n is the estimated parameter vector and e± is the estima- 
tion or tuning error. 

Definition 6.7.1 [161, 163] The system (6.7.1) is said to be tunable on a 
subset S C lZ n if for each 6 6 S and each bounded input r, every possible 
trajectory of the system for which e\ (f) = 0, t G [0, oo) is bounded. 

Lemma 6.7.1 The system (6.7.1) is tunable on S if and only if {C (9) ,A(9)} 
is detectable for each 9 € S. 

If {C(9\ A(#)} is not detectable, then it follows that the state x may grow un- 
bounded even when adaptation is successful in driving e\ to zero by adjusting 
6. One scheme that may exhibit such a behavior is a MRAC of the type 
considered in previous sections that is designed for a nonminimum-phase 
plant. In this case, it can be established that the corresponding system of 
equations is not detectable and therefore not tunable. 

The concept of tunability may be used to analyze the stability properties 
of MRAC schemes by following a different approach than those we discussed 
in the previous sections. The details of this approach are given in [161, 163] 
where it is used to analyze a wide class of adaptive control algorithms. The 
analysis is based on deriving (6.7.1) and establishing that {C(9),A(9)} is 
detectable, which implies tunability. Detectability guarantees the existence 
of a matrix H(9) such that for each fixed 6 E S the matrix A c {9) = A(9 ) — 
H{9)C(9) is stable. Therefore, (6.7.1) may be written as 

x = [A {9) - H(0)C(O)] x + B{9)r + H{9)e 1 (6.7.2) 

by using the so called output injection. Now from the properties of the adap- 
tive law that guarantees 9, [A € £2 where m = 1 + (Cq x) 2 for some vector 
Co, we can establish that the homogeneous part of (6.7.2) is u.a.s., which, 
together with the B-G Lemma, guarantees that x £ C^. The boundedness 
of x can then be used in a similar manner as in the previous sections to es- 
tablish the boundedness of all signals in the closed loop and the convergence 
of the tracking error to zero. 
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6.8 Stability Proofs of MRAC Schemes 

6.8.1 Normalizing Properties of Signal m,f 

In Section 6.5, we have defined the fictitious normalizing signal m/ 

TO / = 1 + II w pI| 2 + ll^pll 2 (6.8.1) 

where || • | denotes the £ 25 -norm, and used its normalizing properties to establish 
stability of the closed-loop MRAC for the adaptive tracking and regulation exam- 
ples. We now extend the results to the general SISO MRAC scheme as follows: 

Lemma 6.8.1 Consider the plant equation (6.3.2) and control law (6.5.39). There 
exists a 6 > 0 such that: 

(i) u>i G Coo- 

(ii) If 6 G Coo, then u p /mf,y p /mf,oj/m.f,W(s)Lij/mf G £oo where W(s) is a 
proper transfer function with stable poles. 

(iii) Ifr,6 G Coo > then in addition to (i) and (ii), we have ||y p ||/m/, ||w||/m/ G Coo- 

Proof (i) Because 

a(s) a(s) 

Ul = ~Afs) Up ' U2 = A (s) yp 

and each element of has relative degree greater or equal to 1, it follows from 
Lemma 3.3.2 and the definition of nif that Wi/m/, o^/m,/ G Coo- 
(ii) We can apply Lemma 3.3.2 to equation (6.5.42), i.e., 

y P = W m (s) (r + \o t lu 

\ c o 

to obtain (using 9 G £oo) that 

\y p (t)\ < c + c\\9 t oj\\ < c+ c||w|| 

where c > 0 denotes any finite constant and || • || the C 28 norm. On the other hand, 
we have 


IMI < IMI + IMI + IMI + IMI < c||u p || + c||y p || + c < cm/ + c 

therefore, G Coo ■ Because u> — [u>{ ,y p ,r\ T and oq/m/, uq /mp, y p /rrif, 
r/nrtf G Coo, it follows that co/m.f G Coo- From u p = 9 T u> and u>/mf,0 G Coo we 
conclude that u p /mf G Coo- Consider </> = W(s)u. We have \(f>\ < c||w|| for some 
6 > 0 such that W(s — |) is stable. Hence, G Coo due to ||w||/m/ G Coo- 
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(iii) Note that y p = sW m (s)[r + ^9 T to] and sW m (s) is a proper stable transfer 
function, therefore from Lemma 3.3.2 we have 

|| jfp|| < c + cll^H < c + cnif 


i.e., \\y P \\/mf G £oo. Similarly, because ||th|| < ||wi|| + ||w 2 || + ||y p || + ||f||, applying 
Lemma 3.3.2 we have ||wi|| < c||u p ||, ||w 2 || < c||j/ p || which together with G 

imply that ||w||/to/ G and the lemma is proved. In (i) to (iii), <5 > 0 is chosen 
so that W m (s), W(s) are analytic in Re[s] > 5/2. □ 


6.8.2 Proof of Theorem 6.5.1: Direct MRAC 

In this section we present all the details of the proof in the five steps outlined in 
Section 6.5.3. 


Step 1. Express the plant input and output in terms of the adaptation error 
9 t uj. From Figure 6.14, we can verify that the transfer function between the input 
r + -\9 t oj and the plant output y„ is given by 

c o 

y p = G c (s ) (r+ \o T to 

\ c o 

where 

c*k p Z p 

Gc{s) = (1 - 9f(sl - F)~ 1 g)R p - k p Z p {9* 2 T (sI - F)~'g + 9* 3 } 

Using (6.3.12), (6.3.13), and (s/— F) _1 ^=^ y, we have, after cancellation of 
all the stable common zeros and poles in G c (s), that G c (s) = W m (s). Therefore, 
the plant output may be written as 


y P = W m (s) (r + (6.8.2) 

Because y p = G p (s)u p and G“ x (s) has stable poles, we have 

u p = G~ 1 (s)W m (s ) + ^9 r wj (6.8.3) 

where G~ 1 (s)W m (s) is stable (due to assumption PI) and biproper (due to assump- 
tion M2). For simplicity, let us denote the £ 2 , 5 -norm IKOtlk for some 5 > 0 by 

|| • || . Using the fictitious normalizing signal mfj = 1 + ||w p || 2 + ||2/ P || 2 , it follows from 
(6.8.2), (6.8.3) and Lemma 3.3.2 that 

to 2 < c + c||0 T w|| 2 


(6.8.4) 
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holds for some S > 0 such that W m (s—S/2)G~ 1 (s—S/2) is a stable transfer function, 
where c > 0 in (6.8.4) and in the remainder of this section denotes any finite 
constant. The normalizing properties of mj have been established in Section 6.8.1, 
i.e., all signals in the closed-loop adaptive system and some of their derivatives are 
bounded from above by to/ provided 9 £ C^,. 

Step 2. Use the swapping lemmas and properties of the C 28 norm to bound 
\\9 T u\\ from above with terms that are guaranteed by the adaptive law to have finite 
C 2 gains. Using the Swapping Lemma A. 2 in Appendix A, we can express 9 T u as 

9 T u = F\ (s, «o) w + 9 T u^j + F(s , ao) (6.8.5) 

where F(s,a 0 ) = 1 Ji(g, ao) = 1 ~- F U ct °) ; ao > 0 is an arbitrary con- 

stant and n* is the relative degree of W rn (,s) . On the other hand, using Swapping 
Lemma A.l, we can write 

9 t lo = W _1 (s) (e T W(s)uj + lU c (s)((fU b (sV T )0)) (6.8.6) 

where W(s) is a strictly proper transfer function with poles and zeros in C~ that 
we will specify later. Using (6.8.6) in (6.8.5) we obtain 

;.T _ t, 

9 t uj = F\[9 oj + 9 t u\ + FW~ 1 [6 t W(s)u> + W c ((W b u> T )6)\ (6.8.7) 

where -F(s)W _1 (s) can be made proper by choosing W(s) appropriately. 

We obtain a bound for ||f? T w|| by considering each adaptive law separately. 

Adaptive Law of Table 6.5. We express the normalized estimation error as 
e = W m L(p*9 T (j) -p£- en 2 ) 

i.e., 

d T cf = ~9 T L~ 1 io = ^(W-'L-'e + p( + en 2 s ) (6.8.8) 

Choosing W{s) = L _1 (s) and substituting for 9 T W{s)u = 9 t L~ 1 (s)lu from (6.8.8) 
into (6.8.7), we obtain 

9 t lo = FU^ uj + 9 t uj) + —FWz}e + FL ^ + W c (W b u T )'() 

p* IP* P* 

Now from the definition of £ and using Swapping Lemma A.l with W(s) = 
L -1 (s), we have £ = Uf — 9 T <j> = L~ 1 u p — 9 T (f, i.e., 

£ = -9 T (j) + L~ 1 9 t lo = W c [{W b u> r )9] 
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Therefore, 

•JT 1 1 

9 t co = Fi[6 u + e T u\ + — FW~ x e + — FL[pW c {W b uj T )9 + en 2 s ] (6.8.9) 

From the definition of F(s), F\(s) and Swapping Lemma A. 2, it follows that for 
ao > 5 

r-it 


|-Fi(s)||oo5 < — , II F(s)W m (s)llootf < caS 

OL 0 


Therefore, 


ll^ll < -(II 0 a>|| + ||0 T ch||) + ca£ (||e|| + M| + \\en 2 s 
«o 


( 6 . 8 . 10 ) 


where w = pW c (s)[Qb(t)9],Qb = Wb(s)u T . Using Lemma 3.3.2, we can show that 
Qb/rrif £ Coo and, therefore, 

\\u>\\ < c\\em f \\ (6.8.11) 

Using the normalizing properties of ?n/ established in Lemma 6.8.1, we have 

||0 w|| < c||0m/||, ||0 T u;|| < c||w|| < cm f , ||en 2 || < c\\en s m f \\ 

It, therefore, follows from (6.8.10), (6.8.11) that 

ll^ll < z-\\b™,f\\ + m./ + cckq * ( || e|| + || 0m/ 1| + ||en s m/||) (6.8.12) 


ao 


which we express as 


a 0 


||0 T u;|| < ca£ \\gm f \\ + —m f 
a 0 


(6.8.13) 


where g 2 = e 2 n 2 + e 2 + |0| 2 , by taking ao > 1 so that 4- < ag and using ||e|| < 
||em/|| due to mf > 1. Since en s ,e,9 £ C 2 , it follows that g £ C 2 . 

Adaptive Laws of Table 6.6. The normalized estimation error is given by 


e = 


W m (s)u p - z _ 9 T (, 


p , n 2 = ^ T ' 


l + «§ - l + n 2> ”e-Y p rp 

where </> p = [W m (s)u>J , W m (s)uJ , W m (s)y pi y p } T . Let 

LOp = [ U) I ,U>2 >2/pi Up ] 

then (f> p = W m {s)u> p . To relate 6 t uj with e, we write 

0 T u> = 0 qU>o + Cgr 
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where u>q = [tuj , ujJ , y p ] T and 9 0 = [6j , 9j , 9^ . From 

y P = W m (r + \ 9 T tu) 
c o 

we have r = — ^9 T u, therefore, 


9 t w = 9qU> 0 + coW^lyp) - ^0 T u; = 9 r u p - °^9 t uj 


i.e., 


5T, . _ H)pT, 
Co 


Using (f> p = W m (s)co p and applying Swapping Lemma A.l we have 


(6.8.14) 


W m (s)9 T u p = 9 T (t> p + W c (W b uJ)9 

From Swapping Lemma A. 2, we have 

jlT 

9 t u p = F 1 (9 ujp + 9 T Up) + F(6 t lu p ) 

where F 1 (s),F(s) satisfy || -Fi (s) || oo5 1| < ||F(s)M / “ 1 (s)|| 00 5 < caf for a 0 > 6. 

Using the above two inequalities, we obtain 

9 t u p = F^e CJ P + 9 t lo p ) + FW-^tp + W c (W b cjJ)0] (6.8.15) 

Using (6.8.14) we obtain 


a T I flT • __ Co 2T I Co 3j T Co 5 T • 

9 ujp + 9 uip — —9 u! H — -9 u> H — -9 lo 
c 0 c 0 c 0 

which we use together with 9 T c/) p = — e — enj to express (6.8.15) as 
9 T u p = Fi (^9 t lj+ ^'e T u+ C ^~e T <^\+FW- 1 [-e-enl + W c {W b ^)e] (6.8.16) 

V c 0 c 0 C 0 / 

Due to the boundedness of A, it follows from (6.8.14) that 

||0 T w|| < c||0 T o; p || (6.8.17) 

Using the same arguments as we used in establishing (6.8.13) for the adaptive laws 
of Table 6.5, we use (6.8.16), (6.8.17) and the normalizing properties of m/ to obtain 

||0 T w|| < c\\9 T u p \\ < caf\\gmf\\ + —m f 


(6.8.18) 
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where g G C 2 . 

Step 3. Use the B-G Lemma to establish signal boundedness. Using (6.8.13) or 
(6.8.18) in m 2 < c + c\\9 T u>\\ 2 , we have 

m 2 f<c + ca q h \\gmf\\ 2 + A^m 2 (6.8.19) 

chq 

which for large «o implies 


m 2 <c + cotQ n \\gm f \\' 2 

for some other constant c > 0. The boundedness of m/ follows from g G £2 and 
the B-G Lemma. 

Because ra/,0 G £oo, it follows from Lemma 6.8.1 that u p ,y p ,u> € £00 and 
therefore all signals in the closed-loop plant are bounded. 

Step 4. Establish convergence of the tracking error to zero. Let us now consider 
the equation for the tracking error that relates e\ with signals that are guaranteed 
by the adaptive law to be in £ 2 - For the adaptive law of Table 6.5(A), we have 

ei = e + W m Len 2 + W m Lpf 

Because £ = L~ 1 9 T to — 6 T (j) = W c {W b u T )9, e,en s ,9 G £2 and u>,n s G £ 00 , it 
follows that ei G £2- In addition we can establish, using (6.8.2), that e \ G £00 and 
therefore from Lemma 3.2.5, we conclude that e\(t) — > 0 as t — » 00 . 

For the adaptive laws of Table 6.5(B,C) the proof follows by using L(s ) = W~\s) 
and following exactly the same arguments as above. For the adaptive laws of Ta- 
ble 6.6 we have 

era 2 = W m 6 T LO — 6 1 (j> p 
Applying Swapping Lemma A.l, we have 

W m 0 T u> = e T W m u + W c (W b u r )6 

i.e., 

era 2 = e T [W m u> - (j) p \ + W c (W b u T )d 
Now, W m u> — (j) p = [0, . . . , 0, — ei] T , hence, 

coei = —era 2 + W c {WbU T )Q 

Because are bounded and cm, 9 € £ 2 , it follows that ei € £ 2 . As before we 

can use the boundedness of the signals to show that e.\ G which together with 
ei G £2 imply that ei(£) — > 0 as t — > 00 . 
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Step 5. Establish parameter convergence. First, we show that </>, <j) p are PE if 
r is sufficiently rich of order 2 n. From the definition of <j>, we can write 


4>=H(s) 


(si - F)~ 1 gu p 
(si - F)~ 1 gy p 

Vp 

r 


where H(s ) = W m (s) if the adaptive law of Table 6.5(B) is used and H(s) = L~ 1 (s ) 
if that of Table 6.5(A) is used. Using u p = Gi p 1 (s)y p and y p = W m (s)r + e\, we 
have 


where 



(si - F) 1 gG 1 (s)W m (s) 
(si - F)- f gW m (s) 
W m (s ) 


1 

"“T 

5cT 

O 

^ 1 

4>m = H(s) 

r, <t> = H(s) 

1 

l-S 

^0 


1 


L 0 j 


ei 


Because ei £ C 2 , it follows from the properties of the PE signals that <j> is PE if 
and only if <j> m is PE. In the proof of Theorem 6.4.1 and 6.4.2, we have proved that 

(si - F)- 1 gG- 1 (s)W m (s) 

A | (si - F)- { gW m (s) 

W m (s) 

1 

is PE provided r is sufficiently rich of order 2 n. Because H(s) is stable and minimum 
phase and (fio £ Coo owing to r £ Coo, it follows from Lemma 4.8.3 (iv) that 
<t>m = H(s)cj)Q is PE. 

From the definition of (f> p , we have 


(ftp fU m (s) 


" (si — F)- 1 gGp 1 (s)W m (s ) " 


" W m (s)(sl — F)- 1 gGp 1 (s ) " 

(sI-F)- l gW m (s) 

r + 

W m (s)(sl - F)-^ 

W m (s ) 

W m (s) 

1 


1 


ei 


Because (f> p has the same form as (j), the PE property of <j> p follows by using the 
same arguments. 

We establish the convergence of the parameter error and tracking error to zero 
as follows: First, let us consider the adaptive laws of Table 6.5. For the adaptive 
law based on the SPR-Lyapunov approach (Table 6.5(A)), we have 

e = WmL(p*9 T (j) - p£ - en 2 s ) 

9 = — r e(f>sgn(k p / k m ) 


( 6 . 8 . 20 ) 
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where Cj (si — A C )~ 1 B C = W m (s)L(s) is SPR. The stability properties of (6.8.20) 
are established by Theorem 4.5.1 in Chapter 4. According to Theorem 4.5.1 (iii), 
9(t) — > 0 as t — > oo provided € £ oo, <j> is PE and £ € £ 2 - Because £ = 
W c (Wj,u; T )0 and w G £ 00 , 0 G £ 2 , we have £ G £ 2 . From u p ,y p G £oo and the 
expression for <fi, we can establish that (f>, <j) G £00 • Because 0 is shown to be PE, 
the convergence of 9(t) to zero follows. From 8(t) — > 0 as t — > 00 and e?r 2 G £2 
and the stability of A c , we have that e(t) — > 0 as t — » 00 . In fact, the convergence 
of e(t) to zero follows from the properties of the Lyapunov-like function used to 
analyze (6.8.20) in Theorem 4.5.1 without requiring <f> to be PE. 

The proof for the adaptive law of Table 6.5(B) follows from the above arguments 
by replacing L~ 1 (s ) with W m (s) and using the results of Theorem 4.5.2 (iii). 

For the adaptive laws of Table 6.6, we have established in Chapter 4 that without 
projection, 9(t) — > 0 as t — > 00 exponentially fast provided 4> p is PE. Since projection 
can only make the derivative V of the Lyapunov-like function V, used to analyze the 
stability properties of the adaptive law, more negative the exponential convergence 
of 9 to zero can be established by following exactly the same steps as in the case of 
no projection. 

6.8.3 Proof of Theorem 6.6.2: Indirect MRAC 

We follow the same steps as in proving stability for the direct MRAC scheme: 

Step 1. Express y p , u p in terms of9 T u. Because the control law for the indirect 
MRAC is the same as the direct MRAC scheme, equations (6.8.2), (6.8.3) still hold, 
i.e., we have 

Up = W m (s) (r + , u p = G~ 1 (s)W m (s) (r+ \ 8 t uj 

\ c o / V c o 

As in the direct case, we define the fictitious normalizing signal 

m) = 1 + ||u p || 2 + ||?/p|| 2 

where || • || denotes the £ 2 < 5 -norm for some 5 > 0. Using the same arguments as in 
the proof of Theorem 6.5.1, we have 

to 2 < c + c||Fu;|| 2 

Step 2. We upper bound ||0 T w|| with terms that are guaranteed by the adaptive 
laws to have £2 gains. From (6.6.25), we have 

9ja n - 2 (s) = A (s) - J-Z p (s,t) ■ Q(s,t) 
k v 


( 6 . 8 . 21 ) 
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9ja„- 2 (s) + 9 3 A(s ) = J-[Q(s,t) • R p (s,t ) - A 0 (s)i? m (s)] 
k p 


( 6 . 8 . 22 ) 


Consider the above polynomial equations as operator equations. We apply (6.8.21) 
to the signal W j^g^ u P , and ( 6 . 8 . 22 ) to y p to obtain 


Oj W m (s)uj i = W m (s)u p - Z p (s , t) ■ Q(s, t) 


W m {s) 
A(s) 1 


0jw m (s)u>2 + 9 3 W m (s)y p = ■ R p (s,t) - A 0 (s)R m (s)} 


W m {s) 

A (8) 


-y P 


Combining these two equations, we have 

0 o bC m (s)u^o = IPm(s)'Up Z p {s, t) • Q(s, t ) 


w m (s) 
A (s) ' 


+ Y~ [Q(s, t) ■ R p (s, t) - A 0 (s)i? m (s)] ^ y p (6.8.23) 

k p A 1 S J 

where 9q = [9j, 9j, ^ 3 ] T , oj 3 = [ujJ, ujJ, j/ p ] t . Repeating the same algebraic manip- 
ulation, but replacing 9 , 9 P by 9 * , 9* in the polynomial equations, we have 


9*'W m (s)cu 0 = W m (s)u p — 


Z p (s)Q(s) W m {s) 


k p A (s) 


+ 


Q(s)R p (s ) -A 0 (s)R m (s) W m (s ) 


k p A (s) 


V P (6-8.24) 


where Q(s) is the quotient of A 0 (s) R m (s) / R p (s) whose order is n* — 1 and n* is 
the relative degree of G p (s). Subtracting (6.8.24) from (6.8.23), we have 


9ZW m {s)u 0 = \-^Zp(s,t)-Q(s,t) Wm ^~ ' 1 


A(s) 


+ — Q(s,t) ■ R p (s,t) 


W m {s ) 

A(s) 


-y P 


1 Ag(s)R m (s) , . 1 Ao(s)i? m (s) / . 

W m (s)y p - r W m (s)y p 


A(a) 


k p A (s) 


Z P (s)Q(s ) Rp(s)Q(s) 

~ -jgr m{ ) v ' 'W l ls ' 


A 


= e f - e l / + e 2 / 


(6.8.25) 


where 6*0 = $o — and 

A 1 ^ A/ A^m(*) . 1 A/ ,\ £> / .'.bFm(s) 

6 / — Zp\ s i t) ■ Q[S, t) r Up + j— Q(s , t) ■ Rp\S , f) - t J/p 

k v A l s J k d A l s J 
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eif,e 2 f are defined as the terms in the second and third brackets of (6.8.25), re- 
spectively. Because co = A(s) = A 0 (s)Z m (s) and Z p (s)u p = R p (s)y p , 

we have 

ei / = (c 0 - cl)y pi e 2 f = 0 (6.8.26) 

Using (6.8.26) in (6.8.25) and defining u) p = W'“ 1 (s)j/ p ] T we can write 

e T W m {s)u p = e f (6.8.27) 

Because 9 T u> = proved in Section 6.8.2 (see equation (6.8.14) ), we use 

Swapping Lemma A. 2 to write 

9 t lo = (Fi(s, a 0 )(6 T Lo p + 6 t uj p ) + F(s, a 0 )9 T io p ) (6.8.28) 

Co v / 

where F(s,a o) and .Fi(s,a:o) are as defined in Section 6.8.2 and satisfy 

||^l(s,a 0 )||oo<5 < — , ||F(s,a 0 )fU~ 1 (s)|| OO(5 < c«o* 

a 0 

for any ag > 6 > 0. Applying Swapping Lemma A.l to W m (s)9 T u p and using 
(6.8.27), we obtain 

e T co p = w-\s)(Fw m {s)^ p + w c {s){w b {s)ul)e) 

= W- 1 (s){e f + W c ( S )(W b (s)coJ)9) (6.8.29) 

Substituting (6.8.29) in (6.8.28), we have 

9 t lu = N (^Fi(s, ao)(6 T uj p + 9 T iL p ) + F(s, «o )W m 1 (s) (ef + W c (s)(Wt,(s)u>J)6^ 

Because Co is bounded from below, i.e., |co| >c 0 > 0, it follows from Lemma 3.3.2 
that 

ll^ll < §r fll^ll + II^ID + ca^dleyll + Ml)) (6.8.30) 

«o V J 

where Co = Wb(to p ) T . From 9 £ C Q 0 and the normalizing properties of ?n/, we have 

||^ T w|| < — \\0m f \\ + — m f + caf (||e/|| + \\9m f \\) (6.8.31) 

<x o oca 

We now concentrate on the term ||e/|| in (6.8.31). Let us denote 

Q(s,t) — Q cx n * — l (^) 5 Zp ( s , t) — b p a m (s), 7?p(s, t) s -t- a p cx n — i (s) 
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where b p = [k p ,pJ] T , a p = p 2 and Pi,P 2 are defined in Table 6.9. Treating s as the 
differentiation operator, using A p (s) = A(s)(s + Ao) and Swapping Lemma A. 3 (i), 
we have 


1 


Z p (s,t) ■ Q(s,t) 


bbm( s ) 1 A, 

Wp = —Q{s,t) ■ Z p (s,t) 


Ms) " fe 


W / m (s)(s + Ao) 
Ap(s) 


Q(M) Z p (s,t) 


1 


-g T Z? n ._ 2 (s) 


W / m (s)(s + Aq) 


Ap(s) 

«n*-2 (s) ( 


(6.8.32) 


W / m (s)(s + Ao) 
Ap(s) 

_ Wm (S) 


(by taking A(s,t) = Q(s,t), B(s,t) = Z p (s,t),f = Mlsf u p) and 


J-Q(s,t) ■ Rp(s,t)~ irfy-yp 

fen A l S J 


= 7- <! Q(s,t) ( Rp(s,t) 


W / m(s)(s + Aq) 


Ap(s) 


~Vp 


(6.8.33) 


-q T D n ._ 2 (s) 


2 (s) a T [_ 1 (s) 


W m { s )( s + Ao) 

Ap(s) 


-y P 


(by taking A(s,t) = Q(s,t), B(s,t) = Rp(s,t),f = jyV P )- Using (6.8.32), 
(6.8.33) in the expression for e/ and noting that 


Wm(s)(s + Aq) 


, kk m (s)(s + Aq) 


Rp(s,t) " m 7Va' ''"' yp - z p (s,t) '- ,n 7 v ;^' - v ' Up = -^w m (s)(s + x 0 )<f> 


A P (s) 


A p (s) 


we have 


e f — j— | — 9jW m (s)(s + Aq)</> + e/| 


(6.8.34) 


where 9 P , (f> are as defined in Table 6.9 and 


A 


e f = q -D n ._ 2 (s) I a n ._ 2 (s) 


«!(*) 


lU m (s)(s + Ao) 
Ap(s) 


^I-i(s) 


W / m(s)(s + Ao) 
Ap(s) 


-y P 



■ ^ 


bp 


*0 


(6.8.35) 


Using the normalizing properties of to/ and the fact that y p = W m {s){r + M0 T ui) 
and 6 £ Coo, we have 

ll e /ll < c\\O p m s \\ 

Applying Swapping Lemma A.l and using 9j<f> = —em 2 , we can write 


9lW m (s)(s + A 0 )</> = -W m {s){s + Ao)cto 2 - W c [W b <t> T ]9 P 


(6.8.36) 
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Again using the normalizing properties of m/, it follows from (6.8.34), (6.8.36) that 


||e/|| < c(||emm/|| + ||0 p m/||) (6.8.37) 


for some constant c. Combining (6.8.31), (6.8.37), we have 

||0 T w|| < — mf + — ||0m/|| + oafi \\9mf\\+caQ (||emm/|| + ||0 p to/||) (6.8.38) 

c*o ao 

From (6.6.25), we can establish, using 9 P € Coo, jr e £oo,0 P € £ 2 , that 9 € £00 
and 9 g £ 2 . Hence, 


\\0 T u\\ < 


c 

a 0 


m f +c\\gm f \\ 


where g 2 = A 7 \q \ 2 _|_ q - 2 n (|f)| 2 _|_ e 2 m 2 + \9 p \ 2 ) and g g £ 2 . 

Step 3. Use the B-G Lemma to establish boundedness. This step is identical 
to Step 3 in the proof of stability for the direct MRAC scheme and is omitted. 


Step 4. Convergence of the tracking error to zero. From 


l l P = Wm{s){r - 


-9 t uj) 


and 9 t uj = —9 T (v v , we have 

c 0 V' 

ei = \w m (s)d T u = W m (s)— 9 t uj p 

c 0 Co 

= -W m {8)Fu p -W c {s) (V b (s)0 T a; p )^ (6.8.39) 

co V c o / 

where the last equality is obtained by applying the Swapping Lemma A.l. Substi- 
tuting 9ju> from (6.8.29) in (6.8.39), we have 

ei = — (e f + W c (s)(W b (s)uJ )d \ - W c (s ) ( (W b (s)6 T 

Co V 7 \ c 0 y 

Note from (6.8.35) and a p ,b p g £ 2 , u p ,y p € Coo that g £ 2 . From em,9 p g £ 2 
and the boundedness of J- , m and </>, it also follows from (6.8.34), (6.8.37) that 
e/ g £ 2 . From e/,0, Co G £ 2 , it follows that ei g £ 2 which together with e.\ g £00 
imply that ei(t) — > 0 as t — > 00 . □ 
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6.9 Problems 


6.1 Consider the first order plant 


V = 


b 

s — 


1 


u 


where b > 0 is the only unknown parameter. Design and analyze a direct 
MR.AC scheme that can stabilize the plant and force y to follow the output 
y rn of the reference model 

2 


for any bounded and continuous reference signal r. 


6.2 The dynamics of a throttle to speed subsystem of a vehicle may be represented 
by the first-order system 

b 

V= 9 + d 

s + a 

where V is the vehicle speed, 6 is the throttle angle and d is a constant load 
disturbance. The parameters b > 0, a are unknown constants whose values 
depend on the operating state of the vehicle that is defined by the gear state, 
steady-state velocity, drag, etc. We would like to design a cruise control 
system by choosing the throttle angle 9 so that V follows a desired velocity 
V m generated by the reference model 


Vm 


0.5 

s + 0.5 


V a 


where V s is the desired velocity set by the driver. 


(a) Assume that a, b , and d are known exactly. Design an MRC law that 

meets the control objective. 

(b) Design and analyze a direct MRAC scheme to be used in the case of a, b , 

and d (with b > 0) being unknown. 

(c) Simulate your scheme in (b) by assuming V s = 35 and using the following 

values for a, b, and d: (i) a = 0.02,6 = 1 . 3 , d = 10; (ii) a = 0.02(2 + 
sin O.Olf), 6 = 1.3, d = 10 sin 0.026. 

6.3 Consider the adaptive tracking problem discussed in Section 6.2.2, i.e., the 
output y of the plant 

x = ax + bu 
V = x 

is required to track the output y rn of the reference model 

X'm = O'm^rn T 6 m T, y m — X m 

where a and 6 are unknown constants with 6 ^ 0 and sign(6) unknown. Design 
an adaptive controller to meet the control objective. 
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6.4 Consider the following plant 

x = — x + bu 

where b yf 0 is unknown. Design an adaptive controller that will force x to 
track x m , the state of the reference model 




for any bounded reference input r. 


6.5 Consider the following SISO plant 


, Zp 
y P = k, 


(s). 


R p (s) 


where k p , Z p (s), and R p (s) are known. Design an MRC scheme for y p to track 
y m generated by the reference model 


Z m {s) 

R m (s ) ? 


where R m .(s) has the same degree as R p (s). Examine stability when Z p (s) is 
Hurwitz and when it is not. Comment on your results. 


6.6 Consider the third order plant 


y P = 


k p {s + b 0 ) 


s 6 + a 2 s^ + ais + a 0 


where a,, * = 0, 1, 2; bo, k p are constants and bo > 0. The transfer function of 
the reference model is given by 


1 


Urn — 


(s + l) 5 


(a) Assuming that di,b 0 , and k p are known, design an MRC law that guar- 

antees closed- loop stability and y p — * y m as t — > oo for any bounded 
reference input r. 

(b) Repeat (a) when cii,bo, and k p are unknown and k p > 0. 

(c) If in (b) a 2 = 0, a\ = 0, ao = 1 are known but k p , bo are unknown, 

indicate the simplification that results in the control law. 


6.7 Show that the MRC law given by (6.3.29) in Remark 6.3.6 meets the MRC 

objective for the plant given by (6.3.1). 

6.8 Show that the MRC law given by (6.3.27) or (6.3.28) in Remark 6.3.5 meets the 

MRC objective for the plant (6.3.1) for any given nonzero initial conditions. 
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6.9 Repeat the proof of Theorem 6.4.2 by using a minimal state space representa- 
tion of the error system ei = W m (s) (s + po) p* 0 T <t> as explained in Remark 
6.4.5. 


6.10 Consider the third-order plant 


Up = 


1 


ip 


s° + a 2 s * + a\s + a 0 
where at,i = 0,1,2 are unknown constants and the reference model 

2 

Urn — 


(s + l)(s + 2)(s + 2.5) 

(a) Design a direct MRAC law with unnormalized adaptive law so that all 

signals in the closed-loop plant are bounded and y p (t) — > y m (t) as t — > oo 
for any bounded reference input r. 

(b) Simulate your scheme by assuming the following values for the plant 
parameters 

ao = 1, Oi = —1.5, <22 = 0 

and examine the effect of your choice of r on parameter convergence. 

6.11 Design and analyze a direct MRAC with normalized adaptive law for the 
plant 

b 

Up — i u p 


where b > 0.5, a are unknown constants. The reference model is given by 

3 

Urn — 


S 3 

(a) Design a direct MRAC scheme based on the gradient algorithm 

(b) Repeat (a) for a least-squares algorithm 

(c) Repeat (a) using the SPR.-Lyapunov design approach with normalization 

(d) Simulate your design in (c) with and without normalization. For simu- 
lations use b = 1.2, a = — 1 and r a signal of your choice. 

6.12 Repeat Problem 6.11(a), for a hybrid MRAC scheme. Simulate your scheme 
using the values of5=1.2,a=— 1 and r a signal of your choice. 


6.13 Consider the plant 


_ (s + fro) 

Up 7 . Y2 U P 

(s + ay 


where 6o > 0.2 and a are unknown constants. The reference model is given 

by 

1 

Um — 


5+1 
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(a) Design a direct MRAC scheme with unnormalizecl adaptive law 

(b) Repeat (a) with a normalized adaptive law 

(c) Design an indirect MRAC scheme with an unnormalized adaptive law 

(d) Repeat (c) with a normalized adaptive law 

(e) Simulate one direct and one indirect MRAC scheme of your choice from 

(a) to (d) and compare their performance when bo = 2, a = —5, a nd 
r = 2 + sin0.8t. Comment. 

6.14 Consider the control law (6.3.27) for the plant (6.3.1) where Q*,i = 1,2,3 , cj$ 
are the desired controller parameters. Design and analyze a direct MRAC 
scheme based on this control law. 

6.15 Consider the control law given by equation (6.3.29) in Remark 6.3.6 designed 
for the plant (6.3.1). Design and analyze a direct MRAC scheme based on 
this control law. 


6.16 Consider the SISO plant 


Up 


kp Z p (s( 
R p (s) 


u p 


where Z p (s), R p (s) are monic, Z p (s) is Hurwitz and the relative degree 
n* = 1. The order n of R p is unknown. Show that the adaptive control law 


Up = - 9y p sgn(kp), 6 = y% 


guarantees signal boundedness and convergence of y p to zero for any finite n. 
(Hint: The plant may be represented as 


x\ = Auxi + A 12 y p , Xi £ lZ n 1 
y P = A 2 \X\ + aoyp + kpiip 


where A n is stable.) 


6.17 Following the same procedure used in Section 6.6.2, derive an indirect MRAC 
scheme using unnormalized adaptive laws for a plant with n* = 2. 


6.18 Let oj € Coo be PE and e £ S(y) (~) ^oo where y > 0. Let = oj + e. Show 
that there exists a y* > 0 such that for any y £ [0,/C), is PE. 

6.19 Consider the MRAC problem of Section 6.4.1. It has been shown (see Remark 
6.4.3) that the nonzero initial condition appears in the error equation as 


ei = W m (s)p*(u p - 9* t co) + Cj (si - A c ) 1 e(0) 
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Show that the same stability results as in the case of e(0) = 0 can be estab- 
lished when e(0) ^ 0 by using the new Lyapunov-like function 


V = 


e T P c e 

2 


e T v~ l e 

2 


|p* | +/3eo P 0 e 0 


where eo is the zero-input response, i.e., 


e 0 = A c e o, e 0 (0) = e(0) 

P 0 satisfies Aj P 0 + PqA c = — I and (3 > 0 is an arbitrary positive constant. 



Chapter 7 

Adaptive Pole Placement 
Control 

7.1 Introduction 

In Chapter 6 we considered the design of a wide class of MRAC schemes for 
LTI plants with stable zeros. The assumption that the plant is minimum 
phase, i.e., it has stable zeros, is rather restrictive in many applications. For 
example, the approximation of time delays often encountered in chemical 
and other industrial processes leads to plant models with unstable zeros. As 
we discussed in Chapter 6, the minimum phase assumption is a consequence 
of the MRC objective that requires cancellation of the plant zeros in an 
effort to make the closed-loop plant transfer function equal to that of the 
reference model. The same assumption is also used to express the desired 
controller parameters in the form of a linear or bilinear parametric model, 
and is, therefore, crucial for parameter estimation and the stability of the 
overall adaptive control scheme. 

Another class of control schemes that is popular in the known parameter 
case are those that change the poles of the plant and do not involve plant 
zero-pole cancellations. These schemes are referred to as pole placement 
schemes and are applicable to both minimum and nonminimum phase LTI 
plants. The combination of a pole placement control law with a parameter 
estimator or an adaptive law leads to an adaptive pole placement control 
(APPC) scheme that can be used to control a wide class of LTI plants with 
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unknown parameters. 

As in the MRAC case, the APPC schemes may be divided into two 
classes: The indirect APPC schemes where the adaptive law generates on- 
line estimates of the coefficients of the plant transfer function that are then 
used to calculate the parameters of the pole placement control law by solving 
a certain algebraic equation; and the direct APPC where the parameters of 
the pole placement control law are generated directly by an adaptive law 
without any intermediate calculations that involve estimates of the plant 
parameters. 

The direct APPC schemes are restricted to scalar plants and to spe- 
cial classes of plants where the desired parameters of the pole placement 
controller can be expressed in the form of the linear or bilinear parametric 
models. Efforts to develop direct APPC schemes for a general class of LTI 
plants led to APPC schemes where both the controller and plant parameters 
are estimated on-line simultaneously [49, 112], leading to a rather complex 
adaptive control scheme. 

The indirect APPC schemes, on the other hand, are easy to design and 
are applicable to a wide class of LTI plants that are not required to be 
minimum phase or stable. The main drawback of indirect APPC is the 
possible loss of stabilizability of the estimated plant based on which the 
calculation of the controller parameters is performed. This drawback can 
be eliminated by modifying the indirect APPC schemes at the expense of 
adding more complexity. Because of its flexibility in choosing the controller 
design methodology (state feedback, compensator design, linear quadratic, 
etc.) and adaptive law (least squares, gradient, or SPR-Lyapunov type), 
indirect APPC is the most general class of adaptive control schemes. This 
class also includes indirect MRAC as a special case where some of the poles 
of the plant are assigned to be equal to the zeros of the plant to facilitate 
the required zero-pole cancellation for transfer function matching. Indirect 
APPC schemes have also been known as self-tuning regulators in the litera- 
ture of adaptive control to distinguish them from direct MRAC schemes. 

The chapter is organized as follows: In Section 7.2 we use several ex- 
amples to illustrate the design and analysis of APPC. These examples are 
used to motivate the more complicated designs in the general case treated in 
the rest of the chapter. In Section 7.3, we define the pole placement control 
(PPC) problem for a general class of SISO, LTI plants and solve it for the 
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case of known plant parameters using several different control laws. The 
significance of Section 7.3 is that it provides several pole placement control 
laws to be used together with adaptive laws to form APPC schemes. The 
design and analysis of indirect APPC schemes for a general class of SISO, 
LTI plants is presented in Section 7.4. Section 7.5 is devoted to the design 
and analysis of hybrid APPC schemes where the estimated plant parameters 
are updated at distinct points in time. The problem of stabilizability of the 
estimated plant model at each instant of time is treated in Section 7.6. A 
simple example is first used to illustrate the possible loss of stabilizability 
and a modified indirect APPC scheme is then proposed and analyzed. The 
modified scheme is guaranteed to meet the control objective and is therefore 
not affected by the possible loss of stabilizability during parameter estima- 
tion. Section 7.7 is devoted to stability proofs of the various theorems given 
in previous sections. 

7.2 Simple APPC Schemes 

In this section we use several examples to illustrate the design and analysis of 
simple APPC schemes. The important features and characteristics of these 
schemes are used to motivate and understand the more complicated ones to 
be introduced in the sections to follow. 

7.2.1 Scalar Example: Adaptive Regulation 

Consider the scalar plant 

y = ay + bu (7-2.1) 

where a and b are unknown constants, and the sign of b is known. The control 
objective is to choose u so that the closed-loop pole is placed at — a m , where 
a m > 0 is a given constant, y and u are bounded, and y(t) converges to zero 
as t — » oo. 

If a and b were known and b ^ 0 then the control law 


u = —ky + r 


(7.2.2) 


k 


a a^Yi 


b 


(7.2.3) 
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where r is a reference input, would lead to the closed-loop plant 


y = ~a m y + br (7.2.4) 

i.e. , the control law described by (7.2.2) and (7.2.3) changes the pole of the 
plant from a to — a m but preserves the zero structure. This is in contrast to 
MRC, where the zeros of the plant are canceled and replaced with new ones. 
It is clear from (7.2.4) that the pole placement law (7.2.2) and (7.2.3) with 
r=0 meets the control objective exactly. 

Let us now consider the case where a and b are unknown. As in the 
MRAC case, we use the certainty equivalence approach to form adaptive 
pole placement control schemes as follows: We use the same control law as 
in (7.2.2) but replace the unknown controller parameter k with its on-line 
estimate k. The estimate k may be generated in two different ways: The 
direct one where k is generated by an adaptive law and the indirect one 
where k is calculated from 

k = HP (7.2 

b 

provided b =/=■ 0 where a and b are the on-line estimates of a and b, respec- 
tively. We consider each design approach separately. 


Direct Adaptive Regulation In this case the time-varying gain k in the 
control law 

u = —ky + r, r = 0 (7.2.6) 

is updated directly by an adaptive law. The adaptive law is developed as 
follows: We add and subtract the desired control input, u* = —ky + r with 
k = a+ £ m in the plant equation, i.e., 

y = ay + bu* — bu* + bu = —a m y — b(k — k)y + br 

to obtain, for r = 0, the error equation 

V = -a m y ~ bky (7.2.7) 

where k = k — k, that relates the parameter error term bky and regulation 
error y through the SPR transfer function — ! - — . As shown in Chapter 4, 

S-pCLtn 
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(7.2.7) is in the appropriate form for applying the Lyapunov design approach. 
Choosing 


V = 


y 

2 


+ 


k 2 | b 

^7 


it follows that for 


k = 7 £/ 2 sgn( 6 ) 


(7.2.8) 


we have 

V = - a m y 2 < 0 

which implies that y,k,u G and y G £ 2 - From (7.2.7) and y,k G £oo we 
have y G £oo; therefore, using Lemma 3.2.5 we obtain y(i) — > 0 as t — > 00 . 

In summary, the direct APPC scheme in (7.2.6) and (7.2.8) guaran- 
tees signal boundedness and regulation of the plant state y(t) to zero. The 
scheme, however, does not guarantee that the closed-loop pole of the plant 
is placed at —a m even asymptotically with time. To achieve such a pole 
placement result, we need to show that k — > a+ ^ m as t — > 00 . For parameter 
convergence, however, y is required to be PE which is in conflict with the 
objective of regulating y to zero. The conflict between parameter identifica- 
tion and regulation or control is well known in adaptive control and cannot 
be avoided in general. 


Indirect Adaptive Regulation In this approach, the gain k(t) in the 
control law 

u = —k(t)y + r, r = 0 (7.2.9) 

is calculated by using the algebraic equation 

£ = 0+0 ^ 
b 

with b 7 ^ 0 and the on-line estimates a and b of the plant parameters a and 
b, respectively. The adaptive law for generating a and b is constructed by 
using the techniques of Chapter 4 as follows: 

We construct the series-parallel model 


Vm = -a m {y m -y) + ay + bu 


(7.2.11) 
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then subtract ( 7 . 2 . 11 ) from the plant equation ( 7 . 2 . 1 ) to obtain the error 
equation 

e = — a m e — ay — bu ( 7 . 2 . 12 ) 

where e = y — y m , a = a — a, b = b — b. Using the Lyapunov-like function 

_ e 2 a 2 6 2 

= Y + 2^ + 2^ 

for some 71 , 72 > 0 and choosing 

a = 71 ey, b = 72 eu ( 7 . 2 . 13 ) 


we have 

V = —a m e 2 < 0 

which implies that e, a, b G £00 and e G £2- These properties, however, 
do not guarantee that 6(f) / 0 Vf > 0 , a condition that is required for fc, 
given by ( 7 . 2 . 10 ), to be finite. In fact, for k to be uniformly bounded, we 
should have |6(f)| > bo > 0 Vf > 0 and some constant 60 • Because such a 
condition cannot be guaranteed by the adaptive law, we modify ( 7 . 2 . 13 ) by 
assuming that \b\ > bo > 0 where 60 and sgn(6) are known a priori, and use 
the projection techniques of Chapter 4 to obtain 


a = 7iey ( 7 . 2 . 14 ) 

^ _ I 72 eu if |6| > 60 or if |6| = 60 and sgn(6)eri > 0 
| 0 otherwise 

where 6 ( 0 ) is chosen so that 6 ( 0 )sgn( 6 ) > 6o- The modified adaptive law 
guarantees that \b(t)\ > 60 Vf > 0 . Furthermore, the time derivative V of V 
along the solution of ( 7 . 2 . 12 ) and ( 7 . 2 . 14 ) satisfies 


V = 


— a m e 2 if |6| > 60 or if |6| = 60 and sgn(6)ew > 0 

—a m e 2 — ben if |6| = 6 q and sgn(6)ett < 0 


Now for |6| 
beu - 


- bo and sgn(6)ert < 0, since |6| > bo, we have 

beu — beu = (|6| — |6|)eu sgn(6) = (6q — \b\)eu sgn(6) > 0 


V < —a m e 2 


therefore. 
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Hence, e,a,b G £oo;e G £2 and \b(t)\ > bo Vf > 0, which implies that 
k E £ 00 . Substituting for the control law (7.2.9) and (7.2.10) in (7.2.11), 
we obtain = —a m y m , which implies that y rn E £ 00 , y m (i) — > 0 as f — > 00 
and, therefore, y,u E £oo. From (7.2.12), we have e G £oo ; which, together 
with e G £ 2 , implies that e(t) = y(t) — y m (t) — > 0 as t —> 00 . Therefore, it 
follows that y(t) = e(t) + y m (t) — > 0 as t — ► 00 . 

The indirect adaptive pole placement scheme given by (7.2.9), (7.2.10), 
and (7.2.14) has, therefore, the same stability properties as the direct one. 
It has also several differences. The main difference is that the gain k is 
updated indirectly by solving an algebraic time varying equation at each 
time t. According to (7.2.10), the control law (7.2.9) is designed to meet the 
pole placement objective for the estimated plant at each time t rather than 
the actual plant. Therefore, for such a design to be possible, the estimated 
plant has to be controllable or at least stabilizable at each time t, which 
implies that \b(t)\ 7 ^ 0 Vt > 0. In addition, for uniform signal boundedness, b 
should satisfy \b(t)\ > 60 > 0 where 60 is a lower bound for | 6 | that is known 
a priori. 

The fact that only the sign of b is needed in the direct case, whereas the 
knowledge of a lower bound 60 is also needed in the indirect case motivated 
the authors of [46] to come up with a modified indirect scheme presented in 
the next section where only the sign of b is needed. 

7.2.2 Modified Indirect Adaptive Regulation 

In the indirect case, the form of the matching equation 

cl bk — 

is used to calculate k from the estimates of a, b by selecting k to satisfy 

CL bk — CLyyi 


as indicated by (7.2.10). 

If instead of calculating k(t) we update it using an adaptive law driven 
by the error 

e c = a(t ) - b(t)k(t ) + a m 

then we end up with a modified scheme that does not require the knowledge 
of a lower bound for \b\ as shown below. The error e c is motivated from the 
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fact that as e c — ► 0 the estimated closed- loop pole a(t) — b(t)k(t) converges 
to — a m , the desired pole. The adaptive law for k is developed by expressing 
e c in terms of k,a,b by using a m = bk — a and adding and subtracting the 
term bk, i.e. , 

e c = a — bk + bk — a + bk — bk = a — bk — bk 


The adaptive law for k is then obtained by using the gradient method to 
. . . £ 2 ~ . 
minimize -# w.r.t. k, i.e., 


~k = - 

^2 Qk 


lbe c 


where 7 > 0 is an arbitrary constant. Because b = |6|sgn(6) and 7 is arbi- 
trary, it follows that 

k = k = 7 0 e c sgn(&) 

for some other arbitrary constant 70 > 0 . To assure stability of the overall 
scheme, the adaptive laws for a, b in ( 7 . 2 . 13 ) are modified as shown below: 


x- z. 1 del 

a = a = 71 ey - 7l - — = 71 ey - 7i£ c 

2 oa 

i r 1 del f 

b = b = 7 2 eu - 72 - — = 7 2 e« + 7 2 ke c 

2 db 


The overall modified indirect scheme is summarized as follows: 


u = ~ky 
k = 7 0 £ c sgn (6) 
a = 7 x ey - 7i£ c 
b = 72 eu + j2ke c 
e c = a - bk + a m , e = y - y m 
y m = -a m (y m ~y) + ay + bu 


Stability Analysis Let us choose the Lyapunov-like function 



442 


CHAPTER 7. ADAPTIVE POLE PLACEMENT CONTROL 


where e = y — y m satisfies the error equation 

e = —a m e — ay — bu (7.2.12) 

The time-derivative V of V along the trajectories of the overall system 
is given by 

V = —a m e 2 — £ c {a — bk — bk) = — a m e 2 — e 2 < 0 

which implies that e,a,b,k G Coo\ e, e c G £ 2 - The boundedness of y m and, 
therefore, of y. u is established as follows: The series-parallel model equation 
(7.2.11) can be rewritten as 

y m = -a m (ym -y) + (a- bk)y = -(a m - e c )y m + £ c e 

Because e, £ c G Coo fj £ 2 , it follows that y m G Coo and, therefore, y,uG Coo- 
From i c ,e G £00 and e,£ c G £ 2 , we have e(t),£ c (t) — > 0 as t — > 00 , which 
imply that y m (t) — > 0 as t — > 00 . Therefore, y(t) — > 0 as t — > 00 . 

The modified indirect scheme demonstrates that we can achieve the same 
stability result as in the direct case by using the same a priori knowledge 
about the plant, namely, the knowledge of the sign of b. For the modified 
scheme, the controller parameter k is adjusted dynamically by using the 
error £ c between the closed-loop pole of the estimated plant, i.e., a — bk 
and the desired pole — a m . The use of £ c introduces an additional gradient 
term to the adaptive law for a, b and increases the complexity of the overall 
scheme. In [46] it was shown that this method can be extended to higher 
order plants with stable zeros. 

7.2.3 Scalar Example: Adaptive Tracking 

Let us consider the same plant (7.2.1) as in Section 7.2.1, i.e., 

y = ay + bu 

The control objective is modified to include tracking and is stated as follows: 
Choose the plant input u so that the closed-loop pole is at — a m \ u, y G £00 
and y(t) tracks the reference signal y m {t ) = c, Vf > 0, where c / 0 is a finite 
constant. 

Let us first consider the case where a and b are known exactly. It follows 
from (7.2.1) that the tracking error e = y — c satisfies 


e = ae + ac+ bu 


(7.2.15) 
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Because a, 6, and c are known, we can choose 

u = —k\e — fc 2 (7.2.16) 

where 

a + a m ac 

h = —^, h = T 

(provided b / 0) to obtain 

e = —a m e (7.2.17) 

It is clear from (7.2.17) that e(t) = y(t ) — y rn — > 0 exponentially fast. 

Let us now consider the design of an APPC scheme to meet the control 
objective when a and b are constant but unknown. The certainty equivalence 
approach suggests the use of the same control law as in (7.2.16) but with 
k\ and hi replaced with their on-line estimates k±(t) and ki(t). respectively, 
i.e., 

u = — k\(t)e — kiit) (7.2.18) 

As in Section 7.2.1, the updating of k\ and hi may be done directly, or 
indirectly via calculation using the on-line estimates a and b of the plant 
parameters. We consider each case separately. 

Direct Adaptive Tracking In this approach we develop an adaptive 
law that updates k\ and ki in (7.2.18) directly without any intermediate 
calculation. By adding and subtracting the desired input u* = — k\e — ki in 
the error equation (7.2.15), we have 

e = ae + ac + b(—k\e — ki) + b(k\e + ki) + bu 
= -a m e + b(u + k±e + fc 2 ) 

which together with (7.2.18) imply that 

e = — a m e — b(k\e + fc 2 ) (7.2.19) 

where k\ = k\—k\, fc 2 = fc 2 — fc 2 . As in Chapter 4, equation (7.2.19) motivates 
the Lyapunov-like function 

r e 2 k\\b\ P 2 \b\ 

2 2 7l 2 72 


(7.2.20) 
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whose time-derivative V along the trajectory of (7.2.19) is forced to satisfy 


V — O' 777,6 


(7.2.21) 


by choosing 

k\ = 7ie 2 sgn(6), k 2 = 72esgn (b) (7.2.22) 

From (7.2.20) and (7.2.21) we have that e,ki,k 2 G Coo and e G £2, which, 
in turn, imply that y, u G Coo and e(t) — > 0 as t — > 00 by following the usual 
arguments as in Section 7.2.1. 

The APPC scheme (7.2.18), (7.2.22) may be written as 


u 

k\ 


-kie — 72Sgn (b) 



7ie 2 sgn (b) 


(7.2.23) 


indicating the proportional control action for stabilization and the integral 
action for rejecting the constant term ac in the error equation (7.2.15). We 
refer to (7.2.23) as the direct adaptive proportional plus integral (PI) con- 
troller. The same approach may be repeated when y m is a known bounded 
signal with known y m G Coo- The reader may verify that in this case, the 
adaptive control scheme 

u = -kie- k 2 y m ~ fay m 

k± = 7ie 2 sgn(6) (7.2.24) 

k '2 = 7 2 ey m sgn(6), k 3 = 7 3 ey m sgn(6) 

where e = y — y m , guarantees that all signals in the closed- loop plant in 
(7.2.1) and (7.2.24) are bounded, and y(t) — > y m [t) as t — > 00. 


Indirect Adaptive Tracking In this approach, we use the same control 
law as in the direct case, i.e., 


u = —kie — k /2 

but with k\ , k /2 calculated using the equations 


ki = 


a A a r 


1 ac 

k2 = T 


(7.2.25) 


b 


(7.2.26) 
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provided b ^ 0, where a and b are the on-line estimates of the plant param- 
eters a and b, respectively. 

We generate a and b using an adaptive law as follows: We start with the 
series-parallel model 


e m = ~a m (e m - e) + a(e + c) + bu 


A 


based on the tracking error equation (7.2.15) and define the error eo = e—e m , 
which satisfies the equation 


eo = —a m e o — a(e + c) — bu 


(7.2.27) 


The following adaptive laws can now be derived by using the same approach 
as in Section 7.2.1: 

a = 7 ie 0 (e + c)= 7 i e 0 y (7.2.28) 

y _ j 72 eo u if \'b\ > bo or if \'b\ = bo and sgn(6)eo« > 0 


0 


otherwise 


where 6(0) satisfies 6(0)sgn(6) > bo- The reader may verify that the time 
derivative of the Lyapunov-like function 




2 

2yi 272 


along the trajectories of (7.2.27), (7.2.28) satisfies 

i L Q-m e 0 

which implies eo,a,b E Coo and eo E £2- Because of (7.2.25) and (7.2.26), 
we have e m = —a m e m , and, thus, e m E Coo and e m {t) — > 0 as t — ► 00 . 
Therefore, we conclude that e,u and eo E Coo- From eo E C2,eo E Coo 
we have eo(i) — > 0 as t — ► 00 , which implies that e(t) — > 0 as t — ► 00 . As 
in Section 7.2.2, the assumption that a lower bound bo for 6 is known can 
be relaxed by modifying the indirect scheme (7.2.25) to (7.2.28). In this 
case both k\ and k /2 are adjusted dynamically rather than calculated from 
(7.2.26). 


Example 7.2.1 Adaptive Cruise Control Most of today’s automobiles are 
equipped with the so-called cruise control system. The cruise control system is 



446 


CHAPTER 7. ADAPTIVE POLE PLACEMENT CONTROL 


responsible for maintaining a certain vehicle speed by automatically controlling the 
throttle angle. The mass of air and fuel that goes into the combustion cylinders and 
generates the engine torque is proportional to the throttle angle. The driver sets 
the desired speed Vd manually by speeding to Vd and then switching on the cruise 
control system. The driver can also set the speed Vd by using the “accelerator” 
button to accelerate from a lower speed to Vd through the use of the cruise control 
system. Similarly, if the driver interrupts a previously set speed by using the brake, 
the “resume” button may be used to allow the cruise control system to accelerate 
to the preset desired speed. Because of the changes in the dynamics of vehicles 
that are due to mechanical wear, loads, aerodynamic drag, etc., the use of adaptive 
control seems to be attractive for this application. 

The linearized model of the throttle system with throttle angle 9 as the input 
and speed V s as the output is of the form 


V s = -aV s + b6 + d (7.2.29) 

where a and b are constant parameters that depend on the speed of the vehicle, 
aerodynamic drag and on the type of the vehicle and its condition. The variable d 
represents load disturbances resulting from uphill situations, drag effects, number 
of people in the vehicle, road condition, etc. The uncertainty in the values of a, 6, 
and d makes adaptive control suitable for this application. 

Equation (7.2.29) is of the same form as equation (7.2.15); therefore, we can 
derive a direct adaptive PI control scheme by following the same procedure, i.e. , 
the adaptive cruise control system is described by the following equations: 

8 = —k\V 3 — k 2 

k = 7iK 2 (7.2.30) 

k 2 = 72 Vs 

where V s = V s — Vi and Vd is the desired speed set by the driver and 71 , 72 >0 are 
the adaptive gains. In (7.2.30), we use the a priori knowledge of sgn(fe) > 0 which 
is available from experimental data. 

The direct adaptive cruise control scheme given by (7.2.30) is tested on an actual 
vehicle [92]. The actual response for a particular test is shown in Figure 7.1. 

The design of an indirect adaptive cruise control scheme follows in a similar 
manner by modifying the approach of Section 7.2.3 and is left as an exercise for the 
reader. V 
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7.3 PPC: Known Plant Parameters 

As we demonstrated in Sections 6.6 and 7.2, an indirect adaptive control 
scheme consists of three parts: the adaptive law that provides on-line es- 
timates for the plant parameters; the mapping between the plant and con- 
troller parameters that is used to calculate the controller parameters from 
the on-^i^n^es^^p^^^^ 

The form of the control law and the mapping between plant parameter 
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estimates and controller parameters are the same as those used in the known 
plant parameter case. The purpose of this section is to develop several 
control laws that can meet the pole placement control objective when the 
plant parameters are known exactly. The form of these control laws as well 
as the mapping between the controller and plant parameters will be used 
in Section 7.4 to form indirect APPC schemes for plants with unknown 
parameters. 

7.3.1 Problem Statement 

Consider the SISO LTI plant 

Up = G p {s)u p , G p (s) = |^y (7.3.1) 

where G p (s ) is proper and R p (s) is a rnonic polynomial. The control objec- 
tive is to choose the plant input u p so that the closed-loop poles are assigned 
to those of a given rnonic Hurwitz polynomial A*(s). The polynomial A*(s), 
referred to as the desired closed-loop characteristic polynomial, is chosen 
based on the closed-loop performance requirements. To meet the control 
objective, we make the following assumptions about the plant: 

Pi. R P (s) is a rnonic polynomial whose degree n is known. 

P2. Z p (s),Rp(s) are coprime and degree(Z p ) < n. 

Assumptions (PI) and (P2) allow Z p , R p to be non-Hurwitz in contrast to the 
MRC case where Z p is required to be Hurwitz. If, however, Z p is Hurwitz, 
the MRC problem is a special case of the general pole placement problem 
defined above with A*(s) restricted to have Z p as a factor. We will explain 
the connection between the MRC and the PPC problems in Section 7.3.2. 

In general, by assigning the closed- loop poles to those of A*(s), we can 
guarantee closed-loop stability and convergence of the plant output y p to zero 
provided there is no external input. We can also extend the PPC objective 
to include tracking, where y p is required to follow a certain class of reference 
signals y m . by using the internal model principle discussed in Chapter 3 as 
follows: The reference signal y m G is assumed to satisfy 


Qm(s)ym — 0 


(7.3.2) 
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where Q m (s), the internal model of y m , is a known rnonic polynomial of 
degree q with nonrepeated roots on the jcu-axis and satisfies 

P3. Q m (s), Zp(s) are coprime. 

For example, if y p is required to track the reference signal y m = 2 + sin(2t), 
then Q m (s) = s(s 2 + 4) and, therefore, according to P3, Z p (.s) should not 
have s or s 2 + 4 as a factor. 

The effect of Q m {s) on the tracking error e\ = y p — y m is explained 
in Chapter 3 for a general feedback system and it is analyzed again in the 
sections to follow. 

In addition to assumptions PI to P3, let us also assume that the co- 
efficients of Z p (s), Rp(s), i.e., the plant parameters are known exactly and 
propose several control laws that meet the control objective. The knowledge 
of the plant parameters is relaxed in Section 7.4. 


7.3.2 Polynomial Approach 

We consider the control law 


Q m (s)L(s)u p = —P(s)y p + M(s)y m (7.3.3) 


where P(s), L(s), M(s) are polynomials (with L(s) rnonic) of degree q + n — 
1, n — 1, q + n — 1, respectively, to be found and Q m (s) satisfies (7.3.2) and 


assumption P3. 

Applying (7.3.3) to the plant (7.3.1), we obtain the closed-loop plant 
equation 


Z P M 

Vv ~ LQ m Rp + PZ p Vm 


(7.3.4) 


whose characteristic equation 


LQ m R p + P Z p — 0 


(7.3.5) 


has order 2 n + q — 1. The objective now is to choose P, L such that 


LQ m R p + PZ p — A* (7.3.6) 

is satisfied for a given rnonic Hurwitz polynomial A*(s) of degree 2n + q — 1. 
Because assumptions P2 and P3 guarantee that Q m R p , Z p are coprime, it 
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follows from Theorem 2.3.1 that L, P satisfying (7.3.6) exist and are unique. 
The solution for the coefficients of L(s),P(s ) of equation (7.3.6) may be 
obtained by solving the algebraic equation 


SM = «r 


(7.3.7) 


where Si is the Sylvester matrix of Q m R pi Z p of dimension 2 (n+q) x 2 (n+q) 

ft = [lJ,p T ] T ,ai = \O t *..,0,l,a* T ] T 


l q = [o,...,o,i,Z T ] T eK n+ « 

<? 

I = [ln-2,ln-3, ■ ■ ■ > hJoV € 77." 1 
P = \Pn+q-l,Pn+q-2, ■ ■ ■ ,Pl,Po] T £ 77 n+9 

«* = ten+q-2, «2n +? -3, ■ ■ • , «!, G ft 2 "**" 1 

li,Pi,a* are the coefficients of 

T(s) = s n 1 + l n —2S n 2 + • • • + iis + Zq = 1 + Z~*~a: n — 2(5) 


P(s) = p n+ q_is n+9 1 + p n+? _ 2 s n+<? 2 H \- Pl s + p 0 = p T a n+q -i(s ) 

A*(a) = s 2 ^- 1 +a ^ +9 _ 2S 2n+9 - 2 + - • -+a^. S +aS = S 2 " + «- 1 +a* T « 2 n+? _ 2 (s) 

The coprimeness of Q m R p , guarantees that 5/ is nonsingular; therefore, 

the coefficients of L(s), P(s) may be computed from the equation 

A = Sf'aj 


Using (7.3.6), the closed-loop plant is described by 

Z V M 

Up — A* Dm (7.3.8) 

Similarly, from the plant equation in (7.3.1) and the control law in (7.3.3) 
and (7.3.6), we obtain 

R P M 

u p — ^ v™ 


(7.3.9) 
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Figure 7.2 Block diagram of pole placement control. 


Because y m €E Coo and Zj ^f , are proper with stable poles, it follows 
that y pi u p G Hoo for any polynomial M(s) of degree n + q — 1. Therefore, 
the pole placement objective is achieved by the control law (7.3.3) without 
having to put any additional restrictions on M(s),Q m (s). When y m = 0, 
(7.3.8), (7.3.9) imply that y p ,u p converge to zero exponentially fast. 

When y rn / 0, the tracking error e\ = y p — y m is given by 

Z V M - A* Z v , LR V ^ 

ei = j; ym = (M - P)y m - Q m y m (7.3.10) 

For zero tracking error, (7.3.10) suggests the choice of M(s) = P(s) to null 
the first term in (7.3.10). The second term in (7.3.10) is nulled by using 
QmUm = 0. Therefore, for M(s) = P(s), we have 

*1 = §[ 0 ] - ^[ 0 ] 

Because are proper with stable poles, it follows that ei converges 

exponentially to zero. Therefore, the pole placement and tracking objective 
are achieved by using the control law 

Q m Lu p = —P(y p y m ) (7.3.11) 

which is implemented as shown in Figure 7.2 using n + q — 1 integrators 
to realize C(s) = q y^Z(s ) ~ Because L(s ) is not necessarily Hurwitz, the 
realization of (7.3.11) with n+q — 1 integrators may have a transfer function, 
namely C(s), with poles outside C~ . An alternative realization of (7.3.11) is 
obtained by rewriting (7.3.11) as 

_ A — LQ m P 
u p — u p — — y m ' 


(7.3.12) 
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Figure 7.3 An alternative realization of the pole placement control. 


where A is any rnonic Hurwitz polynomial of degree n+q—1. The control law 
(7.3.12) is implemented as shown in Figure 7.3 using 2 (n + q — 1) integrators 
to realize the proper stable transfer functions — — We summarize the 
main equations of the control law in Table 7.1. 


Remark 7.3.1 The MRC law of Section 6.3.2 shown in Figure 6.1 is a 
special case of the general PPC law (7.3.3), (7.3.6). We can obtain the 
MRC law of Section 6.3.2 by choosing 

Qm = 1, ^ = Z p A 0 R m , M(s ) = - 


L(s) = A(s) - 9* 1 T a n - 2 (s), P(s ) = ~(9* 2 1 a n - 2 (s ) + ^A(s)) 


*T 


A — Ao Z m , y m — k m r 

I'm 


where Z p , Ao, R m are Hurwitz and Ao, R m , are as defined 

in Section 6.3.2. 


Example 7.3.1 Consider the plant 

b 


where a and b are known constants. The control objective is to choose u p such that 
the poles of the closed-loop plant are placed at the roots of A*(s) = (s + l) 2 and 
y p tracks the constant reference signal y m = 1. Clearly the internal model of y m is 
Q m (s) = s, i.e., q = 1. Because n = 1, the polynomials L,P, A are of the form 


L(s) = l, P(s)=pis + p 0 , A = s + X 0 
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Table 7.1 PPC law: polynomial approach 


Plant 

Z p (s) 

Up ~ R p (s) U P 

Reference input 

Qm(s)y m = 0 

Calculation 

Solve for L(s) = s n_1 + l T a n -^s) and 
P(s) =p T a. n +q-i(s) the polynomial equation 
L(s)Q m (s)R p (s) + P(s)Z p (s ) = A*(s) 
or solve for fii the algebraic equation Si/3i = a], 
where Si is the Sylvester matrix of RpQ m , Z p 

A = [iJ,p T ] T € R^ n+q) 
i q = ]o 1 ^ 2 o,i ,i T } T en n+q 


A*(s) = s 2n+q ~ l + a* T a 2n + q - 2 (s) 
of = [0, . . . , 0, 1, a* T ] T € 

<1 

Control law 

U p — A Up A t'i 

= Up Urn 

Design variables 

H*(s) is rnonic Hurwitz; Q m {s ) is a rnonic poly- 
nomial of degree q with nonrepeated roots on ju 
axis; A(s) is a rnonic Hurwitz polynomial of degree 
n + q — 1 


where Ao > 0 is arbitrary and po,Pi are calculated by solving 

s(s + u) + (p is + Po)b — (s + 1)^ (7.3.13) 

Equating the coefficients of the powers of s in (7.3.13), we obtain 

2 — a 

IT 


P 1 = 


1 

Po= b 
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Equation (7.3.13) may be also written in the form of the algebraic equation (7.3.7), 
i.e. , the Sylvester matrix of s(s + a), b is given by 


Si 


10 0 0 
a 1 0 0 

0 a b 0 

0 0 0 6 


and 


h = 


Therefore, the PPC law is given by 

(s + Ao — s) 

Ur, — ' : Ur, 


0 


' 0 ' 

1 

* 

1 

Pi 

i a i — 

2 

. Po . 


1 


S -\- A 


0 


2 — a 1 
“ 6“ S+ 6 


s H- A( 


-ei 


Aq (2 — &)s H- 1 
s + A 0 Up b(s + A 0 ) 61 


where e, = y p — y m = y v — 1. A state-space realization of the control law is given 

by 

4>i = — Ao<£i + Up 

4 > 2 = — Ao</> 2 + ei 

. , 1 — 2Ao + aAo 2 — a 

Up = AQlpi <p 2 — ei 


V 


7.3.3 State- Variable Approach 


An alternative approach for deriving a PPC law is to use a state observer 
and state-variable feedback. 

We start by considering the expression 


ei 


z p(s) 

Rp(s) 


Up ym 


(7.3.14) 


for the tracking error. Filtering each side of (7.3.14) with where Qi(s) 

is an arbitrary rnonic Hurwitz polynomial of degree q. and using Q m {s)y m = 0 


we obtain 


ei 


Z P Q i _ 

— U' 

R P Q m 


(7.3.15) 
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where 

u P = ^ u p (7.3.16) 

With (7.3.15), we have converted the tracking problem into the regulation 
problem of choosing u p to regulate e\ to zero. 

Let (A,B,C) be a state space realization of (7.3.15) in the observer 
canonical form, i.e., 


where 


A = 


e = Ae + Bu p 
ei = C T e 


In+q— 1 


0 


B = e i c=[i,o,...,o] T 


(7.3.17) 


(7.3.18) 


6*, 02 GTZ n+q are the coefficient vectors of the polynomials R p (s)Q m (s ) — s n+q 
and Z p (s)Qi(s), respectively. Because R p Q m ,Z p are coprime, any possible 
zero-pole cancellation in (7.3.15) between Qi(s) and R p (s)Q m (s ) will oc- 
cur in C~ due to Qi(s) being Hurwitz, which implies that ( A,B ) is always 
stabilizable. 

We consider the feedback law 


u p — 7C c e, 


_ Q i _ 

Up — Up 


where e is the state of the full-order Luenberger observer 


(7.3.19) 


e = Ae + Bu p — K 0 (C T e — e±) 

(7.3.20) 

and K c and K 0 are calculated from 


det (sI-A + BK c ) = A* c (s) 

(7.3.21) 

det (si -A + K q C t ) = A* a (s) 

(7.3.22) 


where A* and A* are given rnonic Hurwitz polynomials of degree n + q. The 
roots of A*(s) = 0 represent the desired pole locations of the transfer function 
of the closed-loop plant whereas the roots of H*(s) are equal to the poles of 
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the observer dynamics. As in every observer design, the roots of A*(s) = 0 
are chosen to be faster than those of A*(s) = 0 in order to reduce the effect 
of the observer dynamics on the transient response of the tracking error e\. 
The existence of K c in (7.3.21) follows from the controllability of ( A,B ). 
If (A, B ) is stabilizable but not controllable because of the common factors 
between Qi(s),R p (s), the solution for K c in (7.3.21) still exists provided 
A*(s) is chosen to contain the common factors of Q\,R p . Because A*(s) 
is chosen based on the desired closed-loop performance requirements and 
Qi(s) is an arbitrary rnonic Hurwitz polynomial of degree q, we can choose 
Qi(s) to be a factor of A*(s) and, therefore, guarantee the existence of K c 
in (7.3.21) even when ( A, B ) is not controllable. The existence of K 0 in 
(7.3.22) follows from the observability of (C, A). Because of the special form 
of (7.3.17) and (7.3.18), the solution of (7.3.22) is given by K 0 = — 91 

where a is the coefficient vector of A*(s). 

Theorem 7.3.1 The PPC law (7.3.19) to (7 .3.22) guarantees that all sig- 
nals in the closed-loop plant are bounded and e\ converges to zero exponen- 
tially fast. 

Proof We define the observation error e 0 = e — e. Subtracting (7.3.20) from 
(7.3.17) we have 

e 0 = (A — K 0 C T )e 0 (7.3.23) 

Using (7.3.19) in (7.3.20) we obtain 

S= (A — BK c )e + K 0 C T e a (7.3.24) 

Because A — K a C T ,A — BK C are stable, the equilibrium e oe = 0, e e = 0 of (7.3.23), 
(7.3.24) is e.s. in the large. Therefore e,e a € £oo and e(t),e 0 (t) — > 0 as t — > oo. 
From e 0 = e — e and u p = —K c e, it follows that e, u p , ei £ £oo and eft), u p (t), 
ei(t) — * 0 as t — > oo. The boundedness of y p follows from that of e\ and y rn . We 
prove that u p € £oo as follows: Because of Assumption P3 and the stability of 
Qi(s), the polynomials Z p Q\, R p Q m have no common unstable zeros. Therefore 
there exists polynomials X, Y of degree n + q — 1 with X monic that satisfy the 
Diophantine equation 

RpQrnX + Z p QiY = A* (7.3.25) 

where A* is a monic Hurwitz polynomial of degree 2 (n + q) — 1 that contains the 
common zeros of Qi(s), R p (s)Q m (s). Dividing each side of (7.3.25) by A* and using 
it to filter u p , we obtain the equation 

RpQmX QiYZ p 

— U P + U P = U P 
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Because Q m u p = Q\u p and Z p u p = R p y p , we have 


RpXQi _ 

u p = A * u p 


QiYR p 

-^ y P 


Because the transfer functions operating on u p , y p are proper with poles in C , then 
u p ,y p € C oo imply that u p £ and the proof is complete. □ 


The main equations of the state variable feedback law are summarized 
in Table 7.2. 


Example 7.3.2 Let us consider the same plant and control objective as in Example 
7.3.1, i.e., 

Plant y p = — - — u p 

s + a 

Control Objective Choose u p such that the closed-loop poles are placed at the 
roots of ^4*(s) = (s + l) 2 and y p tracks y m = 1. 


Clearly, the internal model of y rn is Q m ( s ) = s an d the tracking error e\ = 
Up - Vm satisfies 

b 

— ■ Up 

s + a 

Filtering each side of the above equation with i.e., using Qi(s) = s + 1, we 
obtain 

ei = < 7 ’ 3 ’ 26 ) 

(s + a)s 

where u p = yryUp- The state-space realization of (7.3.26) is given by 


e = 


—a 1 

0 0 


bu r 


e\ = [1 0]e 

The control law is then chosen as follows: 


Observer 


Control Law 



r -o 1 1 


' 1 ' 

e = 

o 

o 

i 

e + 

1 


u p — Tf c e, 


bu p — A' 0 ([l 0]e-ei) 


Up — Up + 


( r)di 


where K a = [k 0l , k Q2 ] T , K c = [k Cl ,k C2 ] are calculated using (7.3.21) and (7.3.22). 
We select the closed-loop polynomial ^(s) = (s + l) 2 and the observer polynomial 
T*(s) = (s + 5) 2 and solve 


det (si — A + BK C ) — (s + l) 2 , det (si — A + K 0 C r ) = (s + 5) 2 
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Table 7.2 State-space pole placement control law 


Plant 

Z p (s) 

yp ~ R p (s) u p 

Reference input 

Qm{s)y m = 0 

Observer 

e = Ae+Bup — K 0 (C T e — e\) 

Li+q— 1 

A = -9\ , B = 0* 2 

0 

c = [ 1 , 0 , . . . , 0 ] , ei = y p — Dm 

where 9 \ , 9% € lZ n+q are the coefficient vectors 
of R p (s)Q m (s ) — s n+q and Z p (s)Q\(s), respec- 
tively 

Calculation 

K 0 = Oq — 9\ where is the coefficient vector of 

j4*(s) — s n+q ; K c is solved from 
det(sJ - A + BK C ) = A* c (s ) 

Control law 

Up — K c e, Up — Q m ( s )U p 

Design variables 

A*(s),A*(s) are rnonic Hurwitz polynomials of 
degree n + q\ Qi(s) is a rnonic Hurwitz polyno- 
mial of degree q; A*(s) contains Qi(s) as a fac- 
tor; Q m (s ) is a rnonic polynomial of degree q with 
nonrepeated roots on the joj axis 


where 


A = 


— a 1 
0 0 


B = 


b , C T = [1,0] 


K c =-[l-a,l], K 0 = [ 10 -a, 25] T 


for K 0 ,K C to obtain 
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Note that the solution for K c holds for any a and b ^ 0. For a 
pair 


— a 1 
0 0 


1 and b ^ 0 the 


is uncontrollable due to a zero-pole cancellation in (7.3.26) at s = —1. Because, 
however, A*(s) = (s+1) 2 contains s+1 as a factor the solution of K c still exists. The 
reader can verify that for A*(s) = (s + 2) 2 (i.e., A*(s) cloesnot have Qi(s) = s + 1 
as a factor) and a = 1, no finite value of K c exists to satisfy (7.3.21). V 


7.3.4 Linear Quadratic Control 

Another method for solving the PPC problem is using an optimization tech- 
nique to design a control input that guarantees boundedness and regulation 
of the plant output or tracking error to zero by minimizing a certain cost 
function that reflects the performance of the closed-loop system. As we have 
shown in Section 7.3.3, the system under consideration is 

e = Ae + Blip 

ei = C T e (7.3.27) 

where u p is to be chosen so that the closed-loop system has eigenvalues that 
are the same as the zeros of a given Hurwitz polynomial A*(s). If the state 
e is available for measurement, then the control input 

zip — K c c 

where I\ c is chosen so that A — BK C is a stable matrix, leads to the closed- 
loop system 


e = (A — BK c )e 

whose equilibrium e e = 0 is exponentially stable. The existence of such K c is 
guaranteed provided (A, B ) is controllable (or stabilizable). In Section 7.3.3, 
K c is chosen so that det(s/ — A + BK C ) = A*(s) is a Hurwitz polynomial. 
This choice of K c leads to a bounded input u p that forces e, e\ to converge 
to zero exponentially fast with a rate that depends on the location of the 
eigenvalues of A — BI\ C , i.e., the zeros of A*(s). The rate of decay of e\ to 
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zero depends on how negative the real parts of the eigenvalues of A — BK C 
are. It can be shown [95] that the more negative these values are, the larger 
the value of K c and, therefore, the higher the required signal energy in u p . 
In the limit, as the eigenvalues of A — BK C are forced to — oo, the input u p 
becomes a string of impulsive functions that restore eft) instantaneously to 
zero. These facts indicate that there exists a trade-off between the rate of 
decay of ei, e to zero and the energy of the input u p . This trade-off motivates 
the following linear quadratic control problem where the control input u p is 
chosen to minimize the quadratic cost 

/»oo 

J = J (ef + A Up)dt 

where A > 0, a weighting coefficient to be designed, penalizes the level of the 
control input signal. The optimum control input u p that minimizes J is [95] 

u p = —K c e, K c = A ~ 1 B T P (7.3.28) 

where P = P T > 0 satisfies the algebraic equation 

A T P + PA- PBX~ 1 B T P + CC T =0 (7.3.29) 

known as the Riccati equation. 

Because (A, B ) is stabilizable, owing to Assumption P3 and the fact that 
Qi(s) is Hurwitz, the existence and uniqueness of P = P T > 0 satisfying 
(7.3.29) is guaranteed [95]. It is clear that as A — > 0, a situation known as low 
cost of control , \\K C \ \ — ► oo, which implies that u p may become unbounded. 
On the other hand if A — > oo, a situation known as high cost of control , 
u p — > 0 if the open-loop system is stable. If the open loop is unstable, then 
u p is the one that minimizes /g°° u p dt among all stabilizing control laws. In 
this case, the real part of the eigenvalues of A — BK C may not be negative 
enough indicating that the tracking or regulation error may not go to zero 
fast enough. With A > 0 and finite, however, (7.3.28), (7.3.29) guarantee 
that A — BK C is a stable matrix, e, e\ converge to zero exponentially fast, 
and u p is bounded. The location of the eigenvalues of A — BK C depends on 
the particular choice of A. For a given A > 0, the polynomial 

f(s) = R p {s)Q m (s)R p (-s)Q m (-s) + \~ 1 Zp(s)Qi(s)Z p (-s)Qi(-s) 
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is an even function of s and f(s) = 0 has a total of 2 (n + q) roots with 
n + q of them in C~ and the other n + q in C + . It can be shown that the 
poles corresponding to the closed-loop LQ control are the same as the roots 
of f(s) = 0 that are located in C~ [6]. On the other hand, however, given 
a desired polynomial A*(s), there may not exist a A so that det (si — A + 
B\~ 1 B T P) = A*(s). Hence, the LQ control solution provides us with a 
procedure for designing a stabilizing control input for the system (7.3.27). 
It doesnot guarantee, however, that the closed-loop system has the same 
eigenvalues as the roots of the desired polynomial A*(s). The significance 
of the LQ solution also relies on the fact that the resulting closed-loop has 
good sensitivity properties. 

As in Section 7.3.3, the state e of (7.3.28) may not be available for mea- 
surement. Therefore, instead of (7.3.28), we use 

Up = —K c e, K c = A _1 L> t P (7.3.30) 

where e is the state of the observer equation 

e = Ae + Bu p — K Q (C T e — e±) (7.3.31) 


and K 0 = 
given by 


a* 0 — 91 as in (7.3.20). As in Section 7.3.3, the control input is 


Up — 


Qi(s) - 
Qm(s) Up 


(7.3.32) 


Theorem 7.3.2 The LQ control law (7.3.30) to (7.3.32) guarantees that 
all signals in the closed-loop plant are bounded and e\(t) — > 0 as t — » oo 
exponentially fast. 


Proof As in Section 7.3.3, we consider the system described by the error equations 


e 0 — (A- I\ 0 C T )e 0 
i = (A — BK c )e + K 0 C T e 0 


(7.3.33) 


where K c = A -1 1? T P and K a is chosen to assign the eigenvalues of A — I\ 0 C T to 
the zeros of a given Hurwitz polynomial A*(s). Therefore, the equilibrium e oe = 0, 
e e = 0 is e.s. in the large if and only if the matrix A — BK C is stable. We establish 
the stability of A — BK C by considering the system 


e= (A — BK c )e = (A - A ~ 1 BB r P)e 


(7.3.34) 
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and proving that the equilibrium e e = 0 is e.s. in the large. We choose the Lyapunov 
function 

V(e) = e T Pe 

where P = P T > 0 satisfies the Riccati equation (7.3.29). Then V along the 
trajectories of (7.3.34) is given by 

V = —e T CC T e - A ~ 1 e r PBB r Pe = -( C T e ) 2 - \{B T Pe) 2 < 0 

A 

which implies that e e = 0 is stable, e £ and C T e, B T Pe £ £ 2 . We now rewrite 
(7.3.34) as 

e = (A - K Q C T )e + K Q C T e - \bB t Pe 

A 

by using output injection, i.e., adding and subtracting the term K a C T e. Because 
A — K 0 C t is stable and C T e, B T Pe £ fj £ 2 , it follows from Corollary 3.3.1 that 
e £ £00 Pi £2 and e — > 0 as t —* 00 . Using the results of Section 3.4.5 it follows that 
the equilibrium e e = 0 is e.s. in the large which implies that 

A - BK C = A - A ~ 1 BB t P 

is a stable matrix. The rest of the proof is the same as that of Theorem 7.3.1 and 
is omitted. □ 

The main equations of the LQ control law are summarized in Table 7.3. 
Example 7.3.3 Let us consider the scalar plant 

x = — ax + bu p 

y P = x 

where a and b are known constants and b ^ 0. The control objective is to choose 
Up to stabilize the plant and regulate y p ,x to zero. In this case Q m (s) = Qi(s) = 1 
and no observer is needed because the state x is available for measurement. The 
control law that minimizes 


pOO 

J = / {y 2 p + ^u 2 p )dt 

Jo 


Up = “ bpy P 


is given by 
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Table 7.3 LQ control law 


Plant 

Z p (s) 

yp — r p (s) u p 

Reference input 

Qm{s)y m = 0 

Observer 

As in Table 7.2 

Calculation 

Solve for P = P T > 0 the equation 

A T P + PA - PBX~ 1 B t P + CC T = 0 

A, B,C as defined in Table 7.2 

Control law 

Up — A B Pi. , Up — Q m(s) Up 

Design variables 

A > 0 penalizes the control effort; Qi(s), Q m (s) 
as in Table 7.2 


where p > 0 is a solution of the scalar Riccati equation 

p 2 b 2 

- 2 ap - ' +1 = 0 

A 

The two possible solutions of the above quadratic equation are 


—A a + \J A 2 a 2 + b 2 A — Xa — VX 2 a 2 + b 2 X 

P 1 — p ’ P 2 ~~ p 


It is clear that p\ > 0 and p 2 < 0; therefore, the solution we are looking for is 
p = Pi >0. Hence, the control input is given by 
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that leads to the closed-loop plant 


x = — 


b 2 

a + —x 

A 


which implies that for any finite A > 0, we have x € £oo and x(t) — * 0 as t — ■> oo 
exponentially fast. It is clear that for A — > 0 the closed-loop eigenvalue goes to 
— oo. For A — > oo the closed-loop eigenvalue goes to — |a|, which implies that if 
the open-loop system is stable then the eigenvalue remains unchanged and if the 
open-loop system is unstable, the eigenvalue of the closed-loop system is flipped to 
the left half plane reflecting the minimum effort that is required to stabilize the 
unstable system. The reader may verify that for the control law chosen above, the 
cost function J becomes 


J = A 


b 2 


K ~ a 

-z 2 (0) 


It is clear that if a > 0, i.e., the plant is open-loop stable, the cost J is less than when 
a < 0, i.e., the plant is open-loop unstable. More details about the LQ problem 
may be found in several books [16, 95, 122]. V 


Example 7.3.4 Let us consider the same plant as in Examples 7.3.1, i.e., 

b 

Plant y v = u v 

s + a 

Control Objective Choose u p so that the closed-loop poles are stable and y v 
tracks the reference signal y m = 1. 

Tracking Error Equations The problem is converted to a regulation problem 
by considering the tracking error equation 

b(s + 1) _ _ s 

e i = 7 . u p ~ 

(s + a)s s + 1 

where e\ = y p — y m generated as shown in Example 7.3.2. The state-space repre- 
sentation of the tracking error equation is given by 


e = 


— a 1 
0 0 


bu r 


ei = [1 , 0] e 

Observer The observer equation is the same as in Example 7.3.2, i.e., 


f -a 1 1 


' 1 ' 

o 

o 

e + 

1 


e = 


bu p -K 0 ([ 1 0]e-ei) 
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where K a = [10 — a, 25] T is chosen so that the observer poles are equal to the roots 
of ^o(s) = (s + 5) 2 . 


Control law The control law, according to (7.3.30) to (7.3.32), is given by 


Up = — A 1 [b, b\Pe, 

where P satisfies the Riccati equation 


s + 1 . 


—a 1 

0 0 

T 

P + P 

—a 1 

0 0 

-p 

■ b ' 

b 

A" 1 [5 b\P + 

' 1 O' 
0 0 


= 0 (7.3.35) 


where A > 0 is a design parameter to be chosen. For A = 0.1, a = —0.5, b = 2, the 
positive definite solution of (7.3.35) is 


P = 

which leads to the control law 


0.1585 0.0117 
0.0117 0.0125 


s 1 

Up = —[3.4037 0.4829]e, u p = u p 

This control law shifts the open-loop eigenvalues from Ai = 0.5, A 2 = 0 to Ai = 
-1.01, A 2 = -6.263. 

For A = 1, a = —0.5, b = 2 we have 


P = 


0.5097 0.1046 
0.1046 0.1241 


leading to 


= -[1.2287 0.4574]e 


and the closed-loop eigenvalues Ai = — 1.69,A 2 = —1.19. 

Let us consider the case where a = 1, b = 1. For these values of a, b the pair 


-1 1 

0 0 


is uncontrollable but stabilizable and the open-loop plant has eigenvalues at Ai = 0, 
A 2 = —1. The part of the system that corresponds to A 2 = —1 is uncontrollable. In 
this case, A = 0.1 gives 


0.2114 0.0289 
0.0289 0.0471 


and Up = —[2.402 0.760]e, which leads to a closed-loop plant with eigenvalues at 
—1.0 and —3.162. As expected, the uncontrollable dynamics that correspond to 
A 2 = — 1 remained unchanged. \7 
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7.4 Indirect APPC Schemes 

Let us consider the plant given by (7.3.1), i.e. , 
y P = Gp(s)u p , G p (s) = 

where R p (s), Z p (s ) satisfy Assumptions PI and P2. The control objective is 
to choose u p so that the closed-loop poles are assigned to the roots of the 
characteristic equation A*(s) = 0, where A*(s) is a given nronic Hurwitz 
polynomial, and y p is forced to follow the reference signal y rn G Coo whose 
internal model Q m (s), i.e., 

Qm{s)y m = 0 

is known and satisfies assumption P3. 

In Section 7.3, we assume that the plant parameters (i.e., the coefficients 
of Z p (s),Rp(s)) are known exactly and propose several control laws that meet 
the control objective. In this section, we assume that Z p (s),R p (s) satisfy 
Assumptions PI to P3 but their coefficients are unknown constants; and use 
the certainty equivalence approach to design several indirect APPC schemes 
to meet the control objective. As mentioned earlier, with this approach we 
combine the PPC laws developed in Section 7.3 for the known parameter case 
with adaptive laws that generate on-line estimates for the unknown plant 
parameters. The adaptive laws are developed by first expressing (7.3.1) 
in the form of the parametric models considered in Chapter 4, where the 
coefficients of Z P ,R P appear in a linear form, and then using Tables 4.1 to 

4.5 to pick up the adaptive law of our choice. We illustrate the design of 
adaptive laws for the plant (7.3.1) in the following section. 

7.4.1 Parametric Model and Adaptive Laws 

We consider the plant equation 

Hp ( s ) y p — Z p {s)u p 

wher eRp(s) = s n +a n -is n ~ l -\ bais+ao, Z p (s) = 6„_is n ^ 1 -] Ibis+bo, 

which may be expressed in the form 


[s n + 0* T a n -i(s)]y p = 0l T a n -i(s)u p 


(7.4.1) 
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r _ -rlT 

where 9* x = 6^ T , (9* — A P ) T and \ p = [A n _i, X n - 2 , ■ ■ ■ , Ao] T is the coeffi- 
cient vector of A p (s) — s n . Equation (7.4.3) can also be used to generate a 
wide class of adaptive laws using the results of Chapter 4. 

The plant parameterizations in (7.4.2) and (7.4.3) assume that the plant 
is strictly proper with known order n but unknown relative degree n* > 
1. The number of the plant zeros, i.e., the degree of Z p (s), however, is 
unknown. In order to allow for the uncertainty in the number of zeros, 
we parameterize Z p (s) to have degree n — 1 where the coefficients of s l for 
i = m + l,m + 2,...,n — 1 are equal to zero and m is the degree of Z p (s) . If 
m < n — 1 is known, then the dimension of the unknown vector 9* is reduced 
to n + m + 1. 

The adaptive laws for estimating on-line the vector 9* or 9\ in (7.4.2), 
(7.4.3) have already been developed in Chapter 4 and are presented in Tables 
4.1 to 4.5. In the following sections, we use (7.4.2) or (7.4.3) to pick up 
adaptive laws from Tables 4.1 to 4.5 of and combine them with the PPC 
laws of Section 7.3 to form APPC schemes. 



468 


CHAPTER 7. ADAPTIVE POLE PLACEMENT CONTROL 


7.4.2 APPC Scheme: The Polynomial Approach 

Let us first illustrate the design and analysis of an APPC scheme based on 
the PPC scheme of Section 7.3.2 using a first order plant model. Then we 
consider the general case that is applicable to an nth-order plant. 


Example 7.4.1 Consider the same plant as in example 7.3.1, i.e., 

b 

yp — : u p 

s + a 


(7.4.4) 


where a and b are unknown constants and u p is to be chosen so that the poles of 
the closed-loop plant are placed at the roots of A*(s) = (s + l) 2 = 0 and y p tracks 
the constant reference signal y rn = 1 Vf > 0. 

Let us start by designing each block of the APPC scheme, i.e., the adaptive law 
for estimating the plant parameters a and b ; the mapping from the estimates of a, 
b to the controller parameters; and the control law. 

Adaptive Law We start with the following parametric model for (7.4.4) 


2 = 



where 


s T A 


Vpi 4 1 — 


1 

Up 

/)* 

i — 

' b ' 

s A 

. ~ Vp . 

5 V 

a 


(7.4.5) 


and A > 0 is an arbitrary design constant. Using Tables 4.1 to 4.5 of Chapter 4, we 
can generate a number of adaptive laws for estimating 0*. For this example, let us 
choose the gradient algorithm of Table 4.2 


Op =Te(j> 


(7.4.6) 


z — I 


£ = 


mr = 1 + < 


where T = T t > 0, 0 P = [b, a] T and a(t), b(t) is the estimate of a and b respectively. 

Calculation of Controller Parameters As shown in Section 7.3.2, the control 
law 

A — LQ m P 

u p = u p - -d (7.4.7) 


Up ^ei 


can be used to achieve the control objective, where A(s) = s + A 0 , L(s) = 1, 

Qm{s) = s , P{s) = pis + po,ei = y p — y m and the coefficients pi,po of P(s) satisfy 
the Diophantine equation 


s(s + a) + (pis + p 0 )b = (s + l) 2 


(7.4.8) 
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or equivalently the algebraic equation 


(7.4.9) 


whose solution is 


Po = ~ b 


Because a and b are unknown, the certainty equivalence approach suggests the use of 
the same control law but with the controller polynomial P(s) = piS + po calculated 
by using the estimates a(t) and b(t) of a and b at each time t as if they were the 
true parameters, i.e., P(s, t ) = Pi(t)s + po(t) is generated by solving the polynomial 
equation 

s • (s + a) + (pis +po) ■ b = (s + l) 2 (7.4.10) 

for pi and po by treating a{t) and 6(f) as frozen parameters at each time t, or by 
solving the algebraic time varying equation 


0 0 

1 0 

a(t) b(t) 

0 0 


(7.4.11) 


for po and p\. The solution of (7.4.10), where ci(t) and 6(f) are treated as constants 
at each time t, is referred to as pointwise to distinguish it from solutions that may 
be obtained with s treated as a differential operator, and a(f) and 6(f) treated as 
differentiable functions of time. 

The Diophantine equation (7.4.10) or algebraic equation (7.4.11) has a unique 
solution provided that (s + a), b are coprime, i.e., provided b ^ 0. The solution is 
given by 

„ . . 2 — a , . . 1 

Pi{t) = Po{t) = j 

b b 

In fact for a, b £ C oo to imply that po , pi G Coo , (s+a) , b have to be strongly coprime, 
i.e., |6| > b 0 for some constant bo > 0. For this simple example, the adaptive law 
(7.4.6) can be modified to guarantee that \b(t)\ > b 0 > 0 Vf > 0 provided sgn(6) and 
a lower bound bo of |6| are known as shown in Section 7.2 and previous chapters. 
For clarity of presentation, let us assume that the adaptive law (7.4.6) is modified 
using projection (as in Section 7.2) to guarantee |6(t)| > b 0 Vi > 0 and proceed with 
the rest of the design and analysis. 

Control Law The estimates Po(t),Pi{t) are used in place of the unknown po, p\ 
to form the control law 


A 0 ( ~ / \ s „ . . 1 

u P = , . u p - Pi(f , . +Po(t) 

s + A 0 \ s+A 0 s + A 0 


{Up Dm) 


(7.4.12) 
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Figure 7.4 Block diagram of APPC for a first-order plant. 


where Ao > 0 is an arbitrary design constant. For simplicity of implementation, 
Ao may be taken to be equal to the design parameter A used in the adaptive law 
(7.4.6), so that the same signals can be shared by the adaptive and control law. 

Implementation The block diagram of the APPC scheme with A = Ao for the 
first order plant (7.4.4) is shown in Figure 7.4. 

For A = Ao, the APPC scheme may be realized by the following equations: 

Filters 

4>i = -Xfa + Up, </>i(o) = o 

02 = -A</>2 - y p , 02(0) = 0 
0m = A0 m Urm 0m (0) = 0 
Z = A02 + Up = 02 


Adaptive Law 

b 

a 


( 7 ie 0 i if |S| > b 0 or if |6| = 
\ 0 otherwise 


72 £02 

2 — 601 — <20 2 
in 2 


m 2 = 1 + 0i 


6 0 and e0isgn6 > 0 


+ 0 


e 


'2 
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Control Law 

Up = Mi- p 0 (t))(<l>2 + <l>m) -Pi(t)(y p -y m ) 

2 -a „ 1 

Pi = —j—, Po=j 

b b 

where 71,72, A > 0 are design constants and 6(0)sgn(6) > 6 0 - 


Analysis The stability analysis of the indirect APPC scheme is carried out in the 
following four steps: 

Step 1. Manipulate the estimation error and control law equations to express 
y p , u p in terms of the estimation error e. We start with the expression for the 
normalized estimation error 

em 2 = 2 — bp 1 — d(f>2 = —p2 — bp 1 — cup 2 

which implies that 

p 2 = —b(f> 1 — a<j> 2 — em 2 (7.4.13) 

From the control law, we have 

Up = Xtpi + Pl(<t >2 + 4 >m) + Po 4>2 + Po 4 >m 
Because u p — \(j>i = <fii, it follows that 

<t>l = Pl<p 2 + P 0 <t >2 + Vm 

where y m = p\(j> m + Po 4 >m- Substituting for ip 2 from (7.4.13) we obtain 

Pi = -Pibpi - (pia - p 0 )p2 ~ Piem 2 + y m (7.4.14) 

Equations (7.4.13), (7.4.14) form the following state space representation for the 
APPC scheme: 


where 


x = A(t)x + bi(t)em 2 + 622/m 

'' = x + Xx = (A(t) + XI)x + bi(t)em 2 + b2y m ( 7 . 4 . 15 ) 

~Vp 



-Pib Po-Pid 
—b —a 




m 2 = 1 + x T x and y m £ Coq. 
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Step 2. Show that the homogeneous part of (7.f.l5) is e.s. For each fixed t, 
det(sJ— A(t)) = (s+a)s+fc(pis+po) = (s+1) 2 , he., A(A(t)) = — 1, Vt > 0. As shown 
in Chapter 4, the adaptive law guarantees that e, a, b G £oo: e > a, b G £oo fj £ 2 . 
From pi = = 7 and b~ x G £00 (because of projection), it follows that 

Pi,Po G £00 and P!,Pg G £ m |j£ 2 . Hence, ||A(t)| G £ 001^^2 which together with 
A(A(t)) = —1 Vt > 0 and Theorem 3.4.11 imply that the state transition matrix 
4>(f , r) associated with Aft) satisfies 

\\^(t,r)\\ < kie~ k2 ^~ T \ Vt > t > 0 


for some constants k \ , k- 2 > 0. 

Step 3. Use the properties of the £25 norm and B-G Lemma to establish 
boundedness. For simplicity, let us now denote || (•)* II 25 for some S > 0 with || • ||. 
Applying Lemma 3.3.3 to (7.4.15) we obtain 

||x|| < c||em 2 || + c, \x(t)\ < c\\em 2 \\ + c (7.4.16) 


for any S G [0, <5i) where <5i > 0 is any constant less than 2k2, and some finite 
constants c > 0. 

As in the MR AC case, we define the fictitious normalizing signal 


m 2 f = 1 + |Kf+ llppf 

From (7.4.15) we have ||u p || + ||j/ p || < c||a?|j + c||em 2 || + c, which, together with 
(7.4.16), implies that 

to 2 < c||e?n 2 || 2 + c 

Because |0i| < c||u p ||, \<j> 2 \ < c||j/ p || for S G [0, 2A), it follows that m = \/l + 0 T <j> < 
cm.f and, therefore, 

m 2 < c||pm/|| 2 + c 

where g = em G £2 because of the properties of the adaptive law, or 


mj < c 


0 -S(t~T)~ 2 


p 2 (r)mj(r)dr + c 


where 0 < 5 < S* and S* = min[2A, <Si], <5i G (0,2fc 2 ). Applying the B-G Lemma, 
we can establish that m/ G £00 • Because m < cm/, we have m and therefore 
01 > 02 , Up, Up G £ 00 • 

Step 4. Establish tracking error convergence. We consider the estimation error 
equation 

em 2 = — 02 — «02 - 60 1 
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2 , 1 f 1 

em = (s + a) — — y p - b — — u p 
s + A s + A 

Operating on each side of (7.4.17) with s = 4z, we obtain 


(7.4.17) 


/ 2\ ( „ 1 , S X 1 

slew ) = s(s + a) -y D — b -u v — b -u v 

v ’ v ' s + X p s + A p s + A p 


(7.4.18) 


by using the property s(xy) = xy + x y. For A = Ao, it follows from the control law 
(7.4.12) that 

— — r u p = -{pis + po) — -rei 
s+A s+A 

which we substitute in (7.4.18) to obtain 

s(em 2 ) = s(s + a) y p + 6(piS + p 0 ) 1 t ei - b u p (7.4.19) 

s+A s + A s + A 


Now, because s(s + a) j^y p = (s + a)j^y p + a-^Nyp and sei = sy p - sy m = sy p 
(note that sy m =0), we have 

, „ 1 , ,, s x 1 

s{s + a )— — y p = {s + a)——e i + a——y p 
s+A s+A s+A 

which we substitute in (7.4.19) to obtain 


s(em 2 ) = (s + a)s + b(pis + po) -ei + 

L J s + A 


1 x 1 j 1 

— ei + a — — y p - b — — -u p 


s+A s+A s+A 


Using (7.4.10) we have (s + a)s + b(p\s + po) — (s + l) 2 and therefore 

, 2\ (s + l) 2 i 1 j 1 

spm = — r-ei + a——y p - b——u p 

s+A s + A s+A 


s(s + A) o 5 + A^f s + A j 1 

e l = 7 _ 7 TT^a ry p + 7 ttxxO -U p 

(s + 1) 2 (s+1) 2 s + A p (s + 1) 2 s + A p 


(7.4.20) 


Because u p ,y p ,m, e £ C Q 0 and a, b, em £ C-ooC\P' 2 i it follows from Corollary 3.3.1 
that ei £ Coo f)£ 2 . Hence, if we show that e.\ £ Coo, then by Lemma 3.2.5 we can 
conclude that e\ — > 0 as t — > oo. Since e\ = y p = ay p + bu p £ Coo, it follows that 
e\ — y 0 as t — > oo. 

We can continue our analysis and establish that e, a,b,p 0 ,p 1 — > 0 as t — > oo. 
There is no guarantee, however, that po,Pi, a,b will converge to the actual values 
Po,Pi,a,b respectively unless the reference signal y rn is sufficiently rich of order 2, 
which is not the case for the example under consideration. 
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As we indicated earlier the calculation of the controller parameters Po(t),pi(t) 
at each time is possible provided the estimated plant polynomials (s + a(t)), b(t) are 
strongly coprime, i.e., provided \b(t)\ > bo > 0 Vt > 0. This condition implies that 
at each time t, the estimated plant is strongly controllable. This is not surprising 
because the control law is calculated at each time t to meet the control objective 
for the estimated plant. As we will show in Section 7.6, the adaptive law without 
projection cannot guarantee that \b(t)\ > bo > 0 Vt > 0. Projection requires 
the knowledge of bo and sgn(6) and constrains b(t) to be in the region \b(t)\ > 
bo where controllability is always satisfied. In the higher-order case, the problem 
of controllability or stabilizability of the estimated plant is more difficult as we 
demonstrate below for a general nth-order plant. V 


General Case 

Let us now consider the nth-order plant 

_ Z.p(s) 

Jr ~ R P (>) " 

where Z p (s),R p (s) satisfy assumptions PI, P2, and P3 with the same control 
objective as in Section 7.3.1, except that in this case the coefficients of Z p , R p 
are unknown. The APPC scheme that meets the control objective for the 
unknown plant is formed by combining the control law (7.3.12), summarized 
in Table 7.1, with an adaptive law based on the parametric model (7.4.2) 
or (7.4.3). The adaptive law generates on-line estimates 9 a ,9b of the coeffi- 
cient vectors, 9* of R p (s) = s n + 9* T a n -i(s) and 9 1 of Z p {s ) = #j) T a n _i(s) 
respectively, to form the estimated plant polynomials 

Rp{s,t ) = s n + 9ja n -i(s), Z p (s,t ) = 0ja n -i(s) 

The estimated plant polynomials are used to compute the estimated con- 
troller polynomials L(s,t), P(s,t) by solving the Diophantine equation 

LQm ■ Hp T P ■ Z p = A* (7.4.21) 

for L, P pointwise in time or the algebraic equation 


Sift = 


(7.4.22) 
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for 0i , where Si is the Sylvester matrix of R p Q m , Z p ; 0i contains the coeffi- 
cients of L, P; and a f contains the coefficients of A*(s) as shown in Table 7.1. 
The control law in the unknown parameter case is then formed as 

Up = (A LQ ni ) —Up P — (y p Vm) (7.4.23) 

Because different adaptive laws may be picked up from Tables 4.2 to 4.5, a 
wide class of APPC schemes may be developed. As an example, we present 
in Table 7.4 the main equations of an APPC scheme that is based on the 
gradient algorithm of Table 4.2. 

The implementation of the APPC scheme of Table 7.4 requires that 
the solution of the polynomial equation (7.4.21) for L,P or of the alge- 
braic equation (7.4.22) for 0i exists at each time. The existence of this 
solution is guaranteed provided that R p (s,t)Q m (s), Z p (s,t) are coprime at 
each time t, i.e., the Sylvester matrix Si(t) is nonsingular at each time t. 
In fact for the coefficient vectors l,p of the polynomials L,P to be uni- 
formly bounded for bounded plant parameter estimates 0 P , the polynomials 
Rp(s,t)Qm(s), Z p (s,t ) have to be strongly coprime which implies that their 
Sylvester matrix should satisfy 

| det(<S)(i))| > u 0 > 0 

for some constant at each time t. Such a strong condition cannot be 
guaranteed by the adaptive law without any additional modifications, giv- 
ing rise to the so called “ stabilizability’' or “ admissibility/' problem to be 
discussed in Section 7.6. As in the scalar case, the stabilizability problem 
arises from the fact that the control law is chosen to stabilize the estimated 
plant (characterized by Z p (s, t), R p (s, t)) at each time. For such a control law 
to exist, the estimated plant has to satisfy the usual observability, controlla- 
bility conditions which in this case translate into the equivalent condition of 
R p (s, t)Qm(s), Rp(s, t) being coprime. The stabilizability problem is one of 
the main drawbacks of indirect APPC schemes in general and it is discussed 
in Section 7.6. In the meantime let us assume that the estimated plant is 
stabilizable, i.e., R p Q m ,Z p are strongly coprime Vt > 0 and proceed with 
the analysis of the APPC scheme presented in Table 7.4. 

Theorem 7.4.1 Assume that the estimated plant polynomials R p Q m , Z p are 
strongly coprime at each time t. Then all the signals in the closed-loop 
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Table 7.4 APPC scheme: polynomial approach. 


Plant 

Z p (s) 

Up ~ R p (s) u p 

Z p {s ) = ^ T a n _i(s) 

Rp{s) = s n + <9* T a n _i(s) 

an-r(s) = [a" -1 , « n “ 2 , • • • , a, 1 } T ,0* P = [dt T ,9* a T ] T 

Reference 

signal 

Qm(s)y m = 0 

Adaptive law 

Gradient algorithm from Table 4.2 

e p = Tec/), r = r T > o 

e = (z — 6jcj))/m 2 , to 2 = 1 + (f 1 (j) 

“, T -iW it 

P — 1 A p (s) U P ’ A p (s) Up\ 

Z ~ A p(s)2/P’ — [^6 > ] 

Z p (s,t) = ^ T a„_i(s), R P (s,t) = s n + 0ja n -i(s) 

Calculation 

Solve for L(s,t ) = s n_1 + £ T a n _ 2 (s), 

P(s,t ) = p T a n+? _i(s) the polynomial equation: 
L(s,t)-Qm(s)-Rp(s,t ) + P(s,t)-Z p (s,t ) = A*(s) 
or solve for A the algebraic equation 

A A = Oil 

where A is the Sylverster matrix of R p Q m , Z p 

A = [lj , P T ] ' T e ^ 2(n+9) , l q = [0 1 _^ I 0, 1,Z t ] t € P n+<? 


A* (a) = s 2 "^- 1 + a* T a 2 n +(? - 2 %) 
af = [0, . . . , 0, 1, a* T ] T € P 2(n+g) 

<? 

Control law 

Up = (A — LQm)-^Up .P y(?/p — Vm) 

Design 

variables 

A*(s) rnonic Hurwitz; A(s) rnonic Hurwitz of degree 
n + q — 1; for simplicity, A(s) = A p (s)A q (s), where 
A p (s),Ag(s) are rnonic Hurwitz of degree n, q — 1, 
respectively 
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APPC scheme of Table 7-4 are u.b. and the tracking error converges to zero 
asymptotically with time. The same result holds if we replace the gradient 
algorithm in Table 7 .4 with any other adaptive law from Tables 4-% and 4-3. 

Outline of Proof: The proof is completed in the following four steps as in Example 
7.4.1: 

Step 1 . Manipulate the estimation error and control law equations to express 
the plant input u p and output y p in terms of the estimation error. This step leads 
to the following equations: 


X 

= A{t)x + biftfem 2 + b 2 yrn 


Up 

= Cj x + diem 2 + d 2 i/m 

(7.4.24) 

Vv 

= Cf x + d^em 2 + d\y m 



where y m € £oo; A(t),bi(t) are u.b. because of the boundedness of the estimated 
plant and controller parameters (which is guaranteed by the adaptive law and the 
stabiliz ability assumption); b -2 is a constant vector; C\ and C 2 are vectors whose 
elements are u.b.; and di to di are u.b. scalars. 

Step 2. Establish the e.s. of the homogeneous part of (7.4-24). The matrix Aft) 
has stable eigenvalues at each frozen time t that are equal to the roots of 7U(s) = 0 . 
In addition 9 p ,l,p € £2 (guaranteed by the adaptive law and the stabilizability 
assumption), imply that ||yi(t)|| € £ 2 - Therefore, using Theorem 3.4.11, we conclude 
that the homogeneous part of (7.4.24) is u.a.s. 

Step 3. Use the properties of the £25 norm and B-G Lemma to establish 
boundedness. Let m 2 = 1 + HupH 2 + \\y p \\ 2 where || • || denotes the £25 norm. Using 
the results established in Steps 1 and 2 and the normalizing properties of m/, we 
show that 

m 2 < c\\emm f\\ 2 + c (7.4.25) 

which implies that 

m 2 < c f e~ s ^ t ~ T ^e 2 m 2 m 2 dT + c 

Jo 

Because em € £ 2 , the boundedness of m/ follows by applying the B-G lemma. 
Using the boundedness of to/, we can establish the boundedness of all signals in 
the closed-loop plant. 

Step 4. Establish that the tracking error ei converges to zero. The convergence 
of ei to zero follows by using the control and estimation error equations to express 
ei as the output of proper stable LTI systems whose inputs are in £2 D £00 • 

The details of the proof of Theorem 7.4.1 are given in Section 7.7. □ 
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7.4.3 APPC Schemes: State- Variable Approach 

As in Section 7.4.2, let us start with a scalar example to illustrate the design 
and analysis of an APPC scheme formed by combining the control law of 
Section 7.3.3 developed for the case of known plant parameters with an 
adaptive law. 

Example 7.4.2 We consider the same plant as in Example 7.3.2, i.e. , 

y P = — 7 —Up (7.4.26) 

where a and b are unknown constants with 6^0 and u p is to be chosen so that the 
poles of the closed-loop plant are placed at the roots of A*(s) = (s + l ) 2 = 0 and 
y p tracks the reference signal y m = 1. As we have shown in Example 7.3.2, if a, b 
are known, the following control law can be used to meet the control objective: 

q U q e+ | bu p -K 0 {[ 10]e-ei) 

5 + 1 

K c e, Up = Up (7.4.27) 

where K 0 , K c are calculated by solving the equations 

det (si — A + BK C ) = (s + l ) 2 
det (si — A + K 0 C t ) = (s + 5 ) 2 

where 

A= ~ a l , B= | 6 , C T = [1,0] 

i.e., 

K c =\[ l-o,l], K a = [ 10 -a, 25] T (7.4.28) 

The APPC scheme for the plant (7.4.26) with unknown a and b may be formed 
by replacing the unknown parameters a and b in (7.4.27) and (7.4.28) with their 
on-line estimates a and b generated by an adaptive law as follows: 

Adaptive Law The adaptive law uses the measurements of the plant input u p and 
output y p to generate a, b. It is therefore independent of the choice of the control 
law and the same adaptive law as the one used in Example 7.4.1 can be employed, 
i.e., 

e p = Pr{r e (/>}, r = r T > 0 
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e = 


0 P = 


Z — O' 6 „ -r- 

p m 2 = l + d) T c 


z = 


m 


.s A 


y P 


(7.4.29) 


' b " 

1 

Up 

a 

’ 0 s + A 

-y P _ 


where A > 0 is a design constant and Pr {•} is the projection operator as defined in 
Example 7.4.1 that guarantees b(t)\ > 6 0 > 0 Mt > 0. 


State Observer 


where 


e = A(i)e + B(t)u p — K 0 (C T e — ei) 


(7.4.30) 


A = 


—a 1 

0 0 


, B = 


b, C T = [1,0] 


Calculation of Controller Parameters Calculate K c . K 0 by solving 

det (si — A + BK C ) = A*(s) = (s + l) 2 , det {si — A + K a C T ) = (s + 5) 2 


for each frozen time t which gives 

K c =l[l-a,l], K 0 = [ 10 -a, 25] T 
b 


(7.4.31) 


Control Law: 


= ~K r 


s + 1 . 


(7.4.32) 


The solution for the controller parameter vector K c exists for any monic Hurwitz 
polynomial ^4J(s) of degree 2 provided (A, B) is stabilizable and A*(s) contains the 
uncontrollable eigenvalues of A as roots. For the example considered, (A, B) loses 
its controllability when 6=0. It also loses its controllability when 6 yf 0 and a = 1. 
In this last case, however, (A, B), even though uncontrollable, is stabilizable and 
the uncontrollable eigenvalue is at s = — 1 , which is a zero of A*(s) = (s + l) 2 . 
Therefore, as it is also clear from (7.4.31), K c exists for all a, b provided 6^0. 
Because the projection operator in (7.4.29) guarantees as in Example 7.4.1 that 
|6(t)| > 6 q > 0 Vi > 0, the existence and boundedness of I\ c follows from a, b € £oo- 


Analysis 


Step 1. Develop the state error equations for the closed-loop APPC scheme. 
The state error equations for the closed-loop APPC scheme include the tracking 
error equation and the observer equation. The tracking error equation 


ei 


b(s + 1) _ _ s 

■ rUp, Up = ■ rU p 

S(S + a) S+l 
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is expressed in the state space form 

0 “ J e+ J bu p (7.4.33) 

ei = C T e = [1, 0]e 

Let e Q = e — e be the observation error. Then from (7.4.30) and (7.4.33) we obtain 
the state equations 

e = A c (t)e + C T e a 

-io 1 1 , r 1 1 - r 1 1 1- 

2g q e G T q aei ^ bup 

u p = -K c e (7.4.34) 

where A c (t) = A(t) — BK C , a = a — a,b = b — b. The plant output is related to eo, e 
as follows: 

Up = ei + y m = C T (e 0 + e) + y m (7.4.35) 

The relationship between u p and eo,e may be developed as follows: 

The coprimeness of 6, s(s + a) implies the existence of polynomials X(s), Y (s) 
of degree 1 and with X(s) monic such that 

s(s + a)X(s) + b(s + l)F(s) = A*(s) (7.4.36) 




where A* (s) = (s+l)a*(s) and a*(s) is any monic polynomial of degree 2. Choosing 
a*(s ) = (s + l) 2 , we obtain X(s) = s + 1 and F(s) = ( 2 ~°) a+1 - Equation (7.4.36) 
may be written as 

^ = 1 (7-4.37) 

(s + 1) 2 (s + 1) 2 


which implies that 


s(s + a) (2 — a)s + 1 
(s + l) 2 ^ + (s + 1) 2 ' 


Using Up = su p = (s + 1 )u p and u p = —K c e, we have 


Up — 


s + a ~ „ (2 — a)s + 1 (s + a) 

J^l Kc£ + (s + 1) 2 V~ 


Up 


(7.4.38) 


Equations (7.4.34), (7.4.35), and (7.4.38) describe the stability properties of the 
closed- loop APPC scheme. 

Step 2. Establish the e.s. of the homogeneous part of (7.4-34)- The homoge- 
neous part of (7.4.34) is considered to be the part with the input aei, bu p set equal 
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to zero. The e.s. of the homogeneous part of (7.4.34) can be established by showing 

that A c (t) is a u.a.s matrix and A 0 = 25 0 * s a stable matrix. Because A 0 

is stable by design, it remains to show that A c (t) = Aft) — B(t)K c [t) is u.a.s. 

The projection used in the adaptive law guarantees as in Example 7.4.1 that 
\b(t)\ > b o > 0,Vt > 0. Because b,a G Coo it follows that the elements of K C ,A C are 
u.b. Furthermore, at each time t, X(A c (t)) = -1,-1, i.e., A c [t ) is a stable matrix 

at each frozen time t. From (7.4.31) and a,b G £ 2 , we have \K C \ G £ 2 . Hence, 
||A c (f)|| G £ 2 and the u.a.s of A c (t) follows by applying Theorem 3.4.11. 

In the rest of the proof, we exploit the u.a.s of the homogeneous part of (7.4.34) 
and the relationships of the inputs aei, bu p with the properties of the adaptive 
law in an effort to first establish signal boundedness and then convergence of the 
tracking error to zero. In the analysis we employ the £ 2 <5 norm IKOtlk which for 
clarity of presentation we denote by || • ||. 

Step 3. Use the properties of the C 2 $ norm and B-G Lemma to establish 
boundedness. Applying Lemmas 3.3.2 and 3.3.3 to (7.4.34), (7.4.35), and (7.4.38), 
we obtain 

||e|| < c||C T e 0 || 

IM < c||C T e 0 || + c||e||+c 
IKII < c||e|| + c||y p ||+c 

for some S > 0 where c > 0 denotes any finite constant, which imply that 

\\y P \\ < c\\C T e 0 \\+c 

IKII < c||C' T e 0 ||+c 


Therefore, the fictitious normalizing signal 

m} = 1 + IKH 2 + IKf ^ c||C' T e 0 || 2 + c (7.4.39) 

We now need to find an upper bound for ||C' T e 0 ||, which is a function of the £ 2 
signals em.,a,b- From equation (7.4.34) we write 


C T e a 


(s + 5) 2 “ ei (s + 5) 2 ^s + l Up 


Applying the Swapping Lemma A.l (see Appendix A) to the above equation and 
using the fact that sei = sy p we obtain 


C T e 0 = + ~ + 5) 2 Mp + W c i(Wbi e i)a - W c2 (W b2 u p )b (7.4.40) 
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where the elements of W C i(s), W b i(s) are strictly proper transfer functions with poles 
at —1, —5. To relate the first two terms on the right-hand side of (7.4.40) with em 2 , 
we use (7.4.29) and z = 0* T <j> to write 

cm 2 = z — 9 p <j) = <j> = a— j— y p - b—^—u p (7.4.41) 

S ~r A St A 

We filter both sides of (7.4.41) with (^*2 and then apply the Swapping Lemma A.l 
to obtain 

^T5^ em2 = a ( s + 5) 2 ^ “ ~ b ( s + 5)2 + Wc {( Wb ypy a ~ ( W b up)b } (7.4.42) 

where the elements of W c (s ), W b (s) are strictly proper transfer functions with poles 
at —5. Using (7.4.42) in (7.4.40), we have that 

C T e a = + ^ em 2 + G(s, a, b) (7.4.43) 

(s + 5)- 


where 


G(s, a, S) = W cl (W bl e{)'a - W c2 (W b2 u p )b - W c {{W b y p ) 'a - (W b u p )i} 

Using Lemma 3.3.2 we can establish that <j>/m,f , m/mf, W b ie\/m,f, W b2 u p /mf, 
Wby p /m f , WbUp/rri f € Coo- Using the same lemma we have from (7.4.43) that 

|| C T e 0 1| < c||emm/|| + c||m/o|| + c||m/S|| (7.4.44) 

Combining (7.4.39), (7.4.44), we have 

m 2 < c||gm/|| 2 + c (7.4.45) 

or 

ntf < c f e~ s( - t ~ T ^g 2 (T)m 2 (T)d,T + c 

Jo 

. 2 A .A 

where g 2 (r) = e 2 m 2 + a + b 2 . Because em,a,b £ C 2l it follows that g € C 2 
and therefore by applying B-G Lemma to (7.4.45), we obtain to/ £ Coo- The 
boundedness of to/ implies that (f>,m,Wbiei,Wb 2 u p ,Wby p ,W b u p £ Coo . Because 

a, b, em 2 £ Coo{}C 2 , it follows that C T e 0 £ CooC\C 2 by applying Corollary 3.3.1 
to (7.4.43). Now by using C T e 0 £ Coo f) C 2 in (7.4.34), it follows from the stability 
of A c (t) and Lemma 3.3.3 or Corollary 3.3.1 that e € Coo D L 2 and e(t) — > 0 as 
t — » oo. Hence, from (7.4.35), we have y p € Coo and e\ £ Coo f) C 2 and from 
(7.4.38) that u p £ Coo. 
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Step 4. Convergence of the tracking error to zero. Because e\ £ Coo (due 
to ei = y p and y p £ Coo) and e\ £ C 00 f^jC 2 , it follows from Lemma 3.2.5 that 
ei (t) — > 0 as t — » oo. 

We can continue the analysis and establish that e, em — > 0 as t — > oo, which 
implies that 9 P , K c — > 0 as t — » oo. 

The convergence of 9 P to 9* = [6, a] T and of K c to K c = £[1 — a, 1] cannot be 
guaranteed, however, unless the signal vector 4> is PE. For <j> to be PE, the reference 
signal y rn has to be sufficiently rich of order 2. Because y m = 1 is sufficiently rich 
of order !,(/)£ 1Z 2 is not PE. V 


General Case 

Let us now extend the results of Example 7.4.2 to the nth-order plant (7.3.1). 
We design an APPC scheme for the plant (7.3.1) by combining the state 
feedback control law of Section 7.3.3 summarized in Table 7.2 with any ap- 
propriate adaptive law from Tables 4.2 to 4.5 based on the plant parametric 
model (7.4.2) or (7.4.3). 

The adaptive law generates the on-line estimates R p (s,t), Z p (s,t) of the 
unknown plant polynomials R p (s),Z p (s), respectively. These estimates are 
used to generate the estimates A and B of the unknown matrices A and B, 
respectively, that are used to calculate the controller parameters and form 
the observer equation. 

Without loss of generality, we concentrate on parametric model (7.4.2) 
and select the gradient algorithm given in Table 4.2. The APPC scheme 
formed is summarized in Table 7.5. 

The algebraic equation for calculating the controller parameter vector 
K c in Table 7.5 has a finite solution for K c (t ) at each time t provided the 
pair (A, B ) is controllable which is equivalent to Z p (s, t)Qi(s), R p (s, t)Q m (s) 
being coprime at each time t. In fact, for K c (t) to be uniformly bounded, 
( A,B ) has to be strongly controllable, i.e., the absolute value of the de- 
terminant of the Sylvester matrix of Z p (s, t)Qi(s), R p (s,t)Q m (s) has to be 
greater than some constant uq > 0 for all t > 0. This strong controllability 
condition may be relaxed by choosing Qi(s) to be a factor of the desired 
closed- loop Hurwitz polynomial A*(s) as indicated in Table 7.5. By doing 
so, we allow Qi(s), R p (s,t) to have common factors without affecting the 
solvability of the algebraic equation for K c , because such common factors are 
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Table 7.5 APPC scheme: state feedback law 


Plant 

y P - R P p { s ] u p 

Z p {s ) = ^ T a n _i(s), Rp(s ) = s n + 9* T a n -i(s) 

a n -i(s)= [s n - 1 ,s n - 2 ,...s,l] T 

Reference 

signal 

Qm(s^ym = 0 

Adaptive law 

Gradi 

Op = : 
z- 

6 = - 

cj) = 

Op = 

Zp (s,t 

ient algorithm based onz = 0* T <j) 

r e(j>, r = r T > o 

m 2 = l + (f) T (j^ 

A p( s) «p> a p (s) > 2 = A p(s) yp 

)=0j (t)a n -i (s), Rp(s,t) = s n +0j(t)a n - 1 (s) 

State observer 

e = Ae + Bu p — K Q (t)(C T e — ei), e £ 7£ n+<7 

In+q—l 

A= -O^t) ,B=0 2 (t),C T =[ 1,0,..., 0] 

0 

6iCR n+q is the coefficient vector of R p Q m — s n+q 

0 2 € 7£ n+,? is the coefficient vector of Z p Q\ 

K 0 = a* — 0 1 , and a* is the coefficient vector of 

A* 0 {s) - s n+q 

Calculation of 

controller 

parameters 

Solve for K c pointwise in time the equation 
det (sI-A + BKc) = A* c (s ) 

Control law 

Up = K c (t)e, Up = Up 

Design 

variables 

Q m {s) rnonic of degree g with nonrepeated roots on 
the juj- axis; Qi(s) rnonic Hurwitz of degree q; A$s) 
rnonic Hurwitz of degree n+q ; A*(s) rnonic Hurwitz of 
degree n + q and with Qi(s) as a factor; A p (s) rnonic 
Hurwitz of degree n 
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also included in A*(s). Therefore the condition that guarantees the existence 
and uniform boundedness of K c is that Z p (s,t), Rp(s,t)Q m (s) are strongly 
coprime at each time t. As we mentioned earlier, such a condition cannot 
be guaranteed by any one of the adaptive laws developed in Chapter 4 with- 
out additional modifications, thus giving rise to the so-called stabilizability 
or admissibility problem to be discussed in Section 7.6. In this section, we 
assume that the polynomials Z p (s,t), R p (s,t)Q m (s) are strongly coprime at 
each time t and proceed with the analysis of the APPC scheme of Table 7.5. 
We relax this assumption in Section 7.6 where we modify the APPC schemes 
to handle the possible loss of stabilizability of the estimated plant. 

Theorem 7.4.2 Assume that the polynomials Z p , R p Q m are strongly co- 
prime at each time t. Then all the signals in the closed-loop APPC scheme 
of Table 7.5 are uniformly bounded and the tracking error e\ converges to 
zero asymptotically with time. The same result holds if we replace the gra- 
dient algorithm with any other adaptive law from Tables f.2 to 4-4 that is 
based on the plant parametric model (7.4-2) or (7.4-3). 

Outline of Proof 

Step 1. Develop the state error equations for the closed-loop APPC scheme, 

i.e., 

e 

e Q 

Vp 
u p 

Up 

where e Q = e — e is the observation error, A a is a constant stable matrix, W±(s) and 
IP 2 (s) are strictly proper transfer functions with stable poles, and A c (t) = A — BK C . 


= A c (t)e + K 0 C T e 0 

= A 0 e 0 + d\ei - 9-2 U p (7.4.46) 

= C T e 0 + C T e + y m 
= W 1 (s)k c (t)e+W 2 (s)y p 
= ~K c e 


Step 2. Establish e.s. for the homogeneous part of (7.4-46)- The gain K c is 
chosen so that the eigenvalues of A c (t) at each time t. are equal to the roots of the 
Hurwitz polynomial A*(s). Because A, B G Coo (guaranteed by the adaptive law) 
and Z p ,RpQ m are strongly coprime (by assumption), we conclude that (A, B) is 
stabilizable in a strong sense and K C ,A C G Coo- Using 0 a ,0b € C 2 , guaranteed by 
the adaptive law, we have K C ,A C G C 2 - Therefore, applying Theorem 3.4.11, we 
have that A c (t) is a u.a.s. matrix. Because A a is a constant stable matrix, the e.s. 
of the homogeneous part of (7.4.46) follows. 
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Step 3. Use the properties of the C 25 norm and the B-G Lemma to establish 
signal boundedness. We use the properties of the C 25 norm and equation (7.4.46) 
to establish the inequality 

mj < c\\gm f \\ 2 +c 

where g 2 = e 2 m 2 + \9 a \ 2 + \ 0 b \ 2 and to 2 = 1 + ||w p || 2 + ||2/ P || 2 is the fictitious 
normalizing signal. Because g£ £ 2 , it follows that m/€ Coo by applying the B-G 
Lemma. Using m.f £ C^, we establish the boundedness of all signals in the closed- 
loop plant. 

Step 4. Establish the convergence of the tracking error ei to zero. This is done 
by following the same procedure as in Example 7.4.2. □ 


The details of the proof of Theorem 7.4.2 are given in Section 7.7. 

7.4.4 Adaptive Linear Quadratic Control (ALQC) 

The linear quadratic (LQ) controller developed in Section 7.3.4 can be made 
adaptive and used to meet the control objective when the plant parameters 
are unknown. This is achieved by combining the LQ control law (7.3.28) to 
(7.3.32) with an adaptive law based on the plant parametric model (7.4.2) 
or (7.4.3). 

We demonstrate the design and analysis of ALQ controllers using the 
following examples: 

Example 7.4.3 We consider the same plant and control objective as in Example 
7.3.3, given by 

x = — ax + bu p 

y P = x (7.4.47) 

where the plant input u p is to be chosen to stabilize the plant and regulate y p to 
zero. In contrast to Example 7.3.3, the parameters a and b are unknown constants. 

The control law u p = — \ bpy p in Example 7.3.3 is modified by replacing the 
unknown plant parameters a, b with their on-line estimates a and b generated by 
the same adaptive law used in Example 7.4.2, as follows: 

Adaptive Law 

6 P 


= Te<j), r = r T > 0 
z-ejf) 

= f — , m =1 + 1 


z = 


s + A( 


■2/p 
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' b " 

1 

Up 

a 

’ 0 s + A 0 

-y P _ 


where Aq > 0 is a design constant. 


Control Law 


Up — 


1 

A 


b(t)p(t)y p 


Riccati Equation Solve the equation 


—2 a(t)p(t) 


p 2 (t)b 2 (t) 

A 


+ 1=0 


at each time t for p(t) > 0, i.e., 


(7.4.48) 


P(t) 


—A a + \J A 2 a 2 + b 2 A 

P 


>0 


(7.4.49) 


As in the previous examples, for the solution p(t) in (7.4.49) to be finite, the estimate 
b should not cross zero. In fact, for p(t) to be uniformly bounded, b(t) should 
satisfy \b(t)\ > bo > 0,V< > 0 for some constant bo that satisfies |6| > b 0 . Using 
the knowledge of bo and sgn(6), the adaptive law for b can be modified as before 
to guarantee |6(t) | > b 0 , Vt > 0 and at the same time retain the properties that 
0 P € £oo and e,em : 0 p £ £2 H Coq. The condition \b(t) > bo implies that the 
estimated plant, characterized by the parameters a, 6, is strongly controllable at 
each time t, a condition required for the solution p[t) > 0 of the Riccati equation 
to exist and be uniformly bounded. 

Analysis For this first-order regulation problem, the analysis is relatively simple 
and can be accomplished in the following four steps: 


Step 1. Develop the closed-loop error equation. The closed-loop plant can be 
written as 


x = —(a + 



+ ax — bii. p 


(7.4.50) 


by adding and subtracting ax — bu p and using u p = —bpx/X. The inputs ax, bu p are 
due to the parameter errors a = a — a,b = b — b. 


Step 2. Establish the e.s. of the homogeneous part of (7. f. 50). The eigenvalue 
of the homogeneous part of (7.4.50) is 
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which is guaranteed to be negative by the choice of p(t) given by (7.4.49), i.e. , 


— (a + 


bAp 
A ’ 



A \Tx 


< 0 


provided, of course, the adaptive law is modified by using projection to guarantee 
\b(t)\ > bo,\/t > 0. Hence, the homogeneous part of (7.4.50) is e.s. 

Step 3. Use the properties of the C 2 s norm and B-G Lemma to establish bound- 
edness. The properties of the input ax — bu p in (7.4.50) depend on the properties 
of the adaptive law that generates a and b. The first task in this step is to establish 
the smallness of the input ax — bu p by relating it with the signals a, 6, and em that 
are guaranteed by the adaptive law to be in C 2 . 

We start with the estimation error equation 

em 2 = z — 6l<l> = — dj cj> = a — 1— - x — b — 1— - u v (7.4.51) 

s H- Aq s H- Aq 


Operating with (s + Ao) on both sides of (7.4.51) and using the property of differ- 

entiation, i.e., sxy = xy + xy where s = 4 l is treated as the differential operator, 
we obtain 

(s + Ao )em 2 = ax — bu p + a —x — b — u p (7.4.52) 

S + Aq s + Aq 


Therefore, 


ax — bu p = (s + Ao )emr 

which we substitute in (7.4.50) to obtain 

b 2 p 2 

x = —(a H — — )x + (s + Ao )sm 
A 


1 f 1 


— a —x 


S + Aq S + Aq 


^ 1 f 1 

— a —x + b — Up 

S + Aq S + Aq 


(7.4.53) 


If we define e = x — em? , (7.4.53) becomes 

^ C- k 2 7V_L_ (\ - h2p \ 2 

e = — (aH — — )e + (Ao — a —)em 

A A 

x = e + em 2 


— a 


S + A 0 


S + An 


(7.4.54) 


Equation (7.4.54) has a homogeneous part that is e.s. and an input that is small in 
some sense because of em, a,b € £ 2 - 

Let us now use the properties of the C 2 s norm, which for simplicity is denoted 
by || • || to analyze (7.4.54). The fictitious normalizing signal m.f satisfies 


mj = l+ \\y p \\ 2 + ||u p || 2 < 1 + c||x|| 2 


(7.4.55) 
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for some 6 > 0 because of the control law chosen and the fact that b,p G C^. 
Because x = e + em 2 , we have ||a;|| < ||e|| + ||£m 2 ||, which we use in (7.4.55) to 
obtain 

m 2 < 1 + c\\e \\ 2 + c\\em 2 \\ 2 (7.4.56) 

From (7.4.54), we have 

||e|| 2 < c||e?n 2 || 2 + c||ax|| 2 + c||6m p || 2 (7.4.57) 

where x = s + Xo x ’ u P = s + Xq u p . Using the properties of the C 25 norm, it can be 
shown that irif bounds from above m,x,u p and therefore it follows from (7.4.56), 
(7.4.57) that 

m 2 < 1 + c||emm/|| 2 + c||am/|| 2 + c||6m/|| 2 (7.4.58) 

which implies that 

m 2 < 1 + c f e _ ' 5 ^ _T ^ 2 (r)m 2 (r)dT 
Jo 

A • 2 ~ 

where g 2 = E 2 mn 2 + a + b . Since em, a,b € £2 imply that g £ £ 2 , the boundedness 
of 777 . f follows by applying the B-G Lemma. 

Now rrif £ implies that m, x, u p and, therefore, <j) € £- 00 ■ Using (7.4.54), 

and the fact that a, 6, em 2 , x = (l/(s + Ao))x, u p = (l/(s + Ao))w p £ £ 00 , we 
have e £ £<*,, which implies that x = y p £ £ 00 , and therefore u p and all signals in 
the closed loop plant are bounded. 

Step 4. Establish that x = y p 0 as t — > 00 . We proceed as follows: Using 
(7.4.54), we establish that e £ £ 2 , which together with em 2 £ £2 imply that x = 
e + em 2 £ £2- Because (7.4.53) implies that x £ £oo, it follows from Lemma 3.2.5 
that x(t) — > 0 as t — » 00 . 

The analysis of the ALQ controller presented above is simplified by the fact that 
the full state is available for measurement; therefore, no state observer is necessary. 
Furthermore, the u.a.s. of the homogeneous part of (7.4.50) is established by simply 
showing that the time-varying scalar a + b 2 p / A > bo/y/X >0, V< > 0, i.e. , that 
the closed-loop eigenvalue is stable at each time t, which in the scalar case implies 
u.a.s. V 

In the following example, we consider the tracking problem for the same 
scalar plant given by (7.4.47). In this case the analysis requires some addi- 
tional arguments due to the use of a state observer and higher dimensionality. 

Example 7.4.4 Let us consider the same plant as in Example 7.4.3 but with the 
following control objective: Choose u p to stabilize the plant and force y p to follow 
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the constant reference signal y m (t) = 1- This is the same control problem we solved 
in Example 7.3.4 under the assumption that the plant parameters a and b are known 
exactly. The control law when a, b are known given in Example 7.3.4 is summarized 
below: 


State Observer 


= Ae + Bu p - I\ 0 {C T e- e i), e\ = y v -y n 


Control Law 


Riccati Equation 


S ~ I - 1 

Up = —A” 1 [6, b]Pe, Up = Up 

A T P + PA — PBB t PX~ 1 + CC T = 0 


where A = 


—a 1 
0 0 ’ 


6, C T = [1,0], K f) = [10 - a, 25]' 


In this example, we assume that a and b are unknown constants and use the 
certainty equivalence approach to replace the unknown a , b with their estimates a , b 
generated by an adaptive law as follows: 


State Observer 

e = A(t)e + Blip — K 0 (t ) ([1 0]e — e{) 

A= " a l ,B = b l ,K 0 = 10 2 " 6 (7.4.59) 


Control Law 

u,p = — -r-[l 1]-Pe, Up = — - — Up (7.4.60) 

A s 

Riccati Equation Solve for P(t) = P T (t) > 0 at each time t the equation 

r> dT 

A T P + PA- P^—P + CC T = 0, C T = [1, 0] (7.4.61) 

A 

The estimates a(t) and b(t) are generated by the same adaptive law as in Example 

7.4.3. 

For the solution P = P T > 0 of (7.4.61) to exist, the pair ( A , B) has to be 
stabilizable. Because (A, B , C) is the realization of , the stabilizability of 

{A, B ) is guaranteed provided b ^ 0 (note that for a = 1 , the pair ( A , B) is 
no longer controllable but it is still stabilizable). In fact for P(t) to be uniformly 
bounded, we require |fo(t)| > b 0 > 0, for some constant bo, which is a lower bound for 
\b\. As in the previous examples, the adaptive law for b can be modified to guarantee 
IHf)| ^ Vf > 0 by assuming that &o and sgn(6) are known a priori. 

Analysis The analysis is very similar to that given in Example 7.4.2. The tracking 
error equation is given by 

e= o ° o 6+ 1 hUp ' ei = [ 1 ’°] e 
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If we define e a = e — e to be the observation error and use the control law (7.4.60) 
in (7.4.59) , we obtain the same error equation as in Example 7.4.2, i.e., 


e = A c (t)e + 


-a + 10 
25 


C T e 


' -10 

1 ' 


' 1 ' 


■ 1 ' 

-25 

0 

e 0 + 

0 

ae 1 — 

1 


where A c (t) = A(t) — BB T P / A and y p , u p are related to e 0 , e through the equations 

Up — C T e o + C T e + y m 


s + a B t P „ (2 - a)s + 1 s + o 

Up= ~s + 1 A 6+ (s + 1) 2 b~ Vp 

If we establish the u.a.s. of A c (t), then the rest of the analysis is exactly the 
same as that for Example 7.4.2. 

Using the results of Section 7.3.4, we can establish that the matrix A c (t) at each 
frozen time t has all its eigenvalues in the open left half s-plane. Furthermore, 

ikwii < iiiwii + 2||i ^ m + 

A A 

where ||A(t)||, ||B(t)|| £ £2 due to a,b £ £ 2 guaranteed by the adaptive law. By 
taking the first-order derivative on each side of (7.4.61), P can be shown to satisfy 

PA c + AjP=-Q (7.4.62) 


where 

Q(t) = A P + PA — P^—P - P^—P 

A A 

For a given A, B , and P, (7.4.62) is a Lyapunov equation and its solution P 
exists and is continuous with respect to Q, i.e., ||P(£)|| < c||Q(£)|| for some constant 
c > 0. Because of ||A(f)||, ||P(£)|| £ £ 2 and A,B,P £ £00, we have ||P(f)|| £ £ 2 
and, thus. A c (t ) £ £ 2 . Because A c (t) is a stable matrix at each frozen time t, we 
can apply Theorem 3.4.11 to conclude that A c is u.a.s. The rest of the analysis 
follows by using exactly the same steps as in the analysis of Example 7.4.2 and is, 
therefore, omitted. V 
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General Case 

Following the same procedure as in Examples 7.4.3 and 7.4.4, we can design 
a wide class of ALQ control schemes for the nth-order plant (7.3.1) by com- 
bining the LQ control law of Section 7.3.4 with adaptive laws based on the 
parametric model (7.4.2) or (7.4.3) from Tables 4.2 to 4.5. 

Table 7.6 gives such an ALQ scheme based on a gradient algorithm for 
the nth-order plant (7.3.1). 

As with the previous APPC schemes, the ALQ scheme depends on the 
solvability of the algebraic Riccati equation. The Riccati equation is solved 
for each time t by using the on-line estimates A, B of the plant parameters. 
For the solution P(t) = P T {t) > 0 to exist, the pair (A, B) has to be 
stabilizable at each time t. This implies that the polynomials R p (s,t)Q m (s ) 
and Z p (s : t)Q\(s) should not have any common unstable zeros at each frozen 
time t. Because Qi(s) is Hurwitz, a sufficient condition for ( A,B ) to be 
stabilizable is that the polynomials Rp(s,t)Q m (s), Z p (s,t) are coprime at 
each time t. For P(t) to be uniformly bounded, however, we will require 
R p (s 1 t)Qm(s), Z p (s,t) to be strongly coprime at each time t. 

In contrast to the simple examples considered, the modification of the 
adaptive law to guarantee the strong coprimeness of R p Q m , Z p without 
the use of additional a priori information about the unknown plant is not 
clear. This problem known as the stabilizability problem in indirect APPC 
is addressed in Section 7.6 . In the meantime, let us assume that the stabiliz- 
ability of the estimated plant is guaranteed and proceed with the following 
theorem that states the stability properties of the ALQ control scheme given 
in Table 7.6. 

Theorem 7.4.3 Assume that the polynomials R p (s,t)Q m (s), Z p (s,t) are 
strongly coprime at each time t. Then the ALQ control scheme of Table 
7.6 guarantees that all signals in the closed-loop plant are bounded and the 
tracking error e\ converges to zero as t. — > oo. The same result holds if we 
replace the gradient algorithm in Table 7.6 with any other adaptive law from 
Tables f.2 to 4-4 based on the plant parametric model (7.4-2) or (7.4-3). 

Proof The proof is almost identical to that of Theorem 7.4.2, except for some 
minor details. The same error equations as in the proof of Theorem 7.4.2 that 
relate e and the observation error e D = e — e with the plant input and output also 
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Table 7.6 Adaptive linear quadratic control scheme 


Plant 

Z p (s) 

yp ~ R p (s) u p 

Z p (s ) = 

Rp(s) =s n + ei T a n ^(s) 

On-l(s) = [« n_1 , S n ~ 2 , . . . , S, 1] T 

Reference 

signal 

Qm{s)y m = 0 

Adaptive law 

Same gradient algorithm as in Table 7.5 to generate 
Z p {s,t ) = Oj (t)a n -i(s), Rp(s, t) = s n + 6j (t)a n _i(s) 

State observer 

e = A(t)e + Bup — K D (t)(C T e — ei) 

In+q—l 

A(t)= -di , B(t) = e 2 (t) 

0 j 

ko(t) = a* - 01, C = [1, 0, . . . , 0] T € n n+q 

6\ is the coefficient vector of R p (s,t)Q m (s)—s n+q 

02 is the coefficient vector of Z p (s,t)Qi(s) 
a* is the coefficient vector of A*(s) — s n+q 

01 , 02 , a* 6 IZ n+q 

Riccati 

equation 

Solve for P(t) = P T (t) > 0 the equation 

A T P + PA - ±PBB t P + CC T = 0 

Control law 

V — 1 R T Pp V — y 

Up— x n re, a P ~ Q m { s ) a P 

Design 

variables 

Qm(s) is a rnonic polynomial of degree q with nonre- 
peated roots on the ju axis; A*(s) is a rnonic Hurwitz 
polynomial of degree n + q with relatively fast zero; 

A > 0 as in Table 7.3; Qi(s) is a rnonic Hurwitz poly- 
nomial of degree q. 
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hold here, i.e., 

e = A c (t)e + K 0 C T e 0 
e Q = A 0 e 0 + die± + d 2 K c e 
Up — C T e Q + C' T e + y m 
u p = Wi(s)K c (t)e + W 2 (s)y p 

The only difference is that in A c = A — BK C , we have K c = B T (t)P(t)/ A. If we 
establish that K c £ Coo, and A c is u.a.s., then the rest of the proof is identical to 
that of Theorem 7.4.2. 

The strong coprimeness assumption about R p Q m , Z p guarantees that the so- 
lution P(t) = P T (t) > 0 of the Riccati equation exists at each time t and P £ Coo- 
This, together with the boundedness of the plant parameter estimates, guarantee 
that B, and therefore K c £ Coo ■ Furthermore, using the results of Section 7.3.4, 
we can establish that A c (t) is a stable matrix at each frozen time t. As in Example 
7.4.4, we have 

iiA c( t)ii < iii W ii + 2 n^ii a m + \mmm 

X A 

and 

PA C + AjP = -Q 

where 

^ - a t 

i T i pbb t p pbb p 

Q = A P + PA 

A A 

which, as shown earlier, imply that ||-P(t)|| and, thus, ||A c (t)|| £ C 2 . 

The pointwise stability of A c together with ||A c (t)|| £ C 2 imply, by Theo- 
rem 3.4.11, that A c is a u.a.s. matrix. The rest of the proof is completed by 
following exactly the same steps as in the proof of Theorem 7.4.2. □ 


7.5 Hybrid APPC Schemes 

The stability properties of the APPC schemes presented in Section 7.4 are 
based on the assumption that the algebraic equations used to calculate the 
controller parameters are solved continuously and instantaneously. In prac- 
tice, even with high speed computers and advanced software tools, a short 
time interval is always required to complete the calculations at a given time t. 
The robustness and stability properties of the APPC schemes of Section 7.4 
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Table 7.7 Hybrid adaptive law 


Plant 

Z p ( s ) 

yp ~ R p (s) u p 

Z p (s ) = dl T a n -i(s),R p (s) = s n + 9*J a n -i (s) 
On-l(«) = [s n_1 , s n ~ 2 , . . . , s, 1] T 

Adaptive law 

6 p (k+ 1 ) = 0pk + T // fc fc+1 e(r)^(r)dr, k = 0 , 1, . . . 

^ - [ A p {s) U P ’ A p (s) yp\ ’ A p ( S ) y P 

e = m 2 fc , m 2 = 1 + (j) T (l), \/t G [t k ,t k+ 1) 

®pk = yJkV 

Rp(s, t k ) =S T @a(k—l)C >ln ~ l(®) 

Z p (s,t k ) = 

Design variables 

T s = t k+ i -t k > T m ; 2 - T s Amax(r) > 1, for some 

7 > 0; A p(s) is rnonic and Hurwitz with degree n 


with respect to such computational real time delays can be considerably 
improved by using a hybrid adaptive law for parameter estimation. The 
sampling rate of the hybrid adaptive law may be chosen appropriately to 
allow for the computations of the control law to be completed within the 
sampling interval. 

Let T m be the maximum time for performing the computations required 
to calculate the control law. Then the sampling period T s of the hybrid 
adaptive law may be chosen as T s = t k+ i — t k > T m where { t k : k = 
1,2,.. .} is a time sequence. Table 7.7 presents a hybrid adaptive law based 
on parametric model (7.4.2). It can be used to replace the continuous-time 
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Table 7.8 Hybrid APPC scheme: polynomial approach 


Plant 

£ 

II 

S3 JNI 

S3 S3 

£5 

S3 

Reference signal 

Qm{s)y m — 0 

Adaptive law 

Hybrid adaptive law of Table 7.7 

Algebraic 

equation 

Solve for L(s, tk) = s n ~ 1 +l T (tk)a n - 2 {s) 

P(s,tk) = p T {t k )a n+q -i (s) from equation 

L(s, tk')Q7n(s')H P {Si tk) T P(s, tk)Z p (s, tk) — A (s) 

Control law 

A (s) — L(s,tk)Qm.(s) P{s,tk) t ^ 

U P- A ( a ) U P A( s ) Wp ym > 

Design variables 

A* rnonic Hurwitz of degree 2ra + q — 1; Q m (s) 
rnonic of degree q with nonrepeated roots on the 
ju axis; A(s) = A p (s)A g (s); A p (s),A q (s) rnonic 
and Hurwitz with degree n, q — 1, respectively 


adaptive laws of the APPC schemes discussed in Section 7.4 as shown in 
Tables 7.8 to 7.10. 

The controller parameters in the hybrid adaptive control schemes of Ta- 
bles 7.8 to 7.10 are updated at discrete times by solving certain algebraic 
equations. As in the continuous-time case, the solution of these equations ex- 
ist provided the estimated polynomials R p (s, tk)Q m (s), Z p (s, tk) are strongly 
coprime at each time tk- 

The following theorem summarizes the stability properties of the hybrid 
APPC schemes presented in Tables 7.8 to 7.10. 

Theorem 7.5.1 Assume that the polynomials R p (s,tk)Qm(s), Z p (s,tk ) are 
strongly coprime at each time t = tk- Then the hybrid APPC schemes given 
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Table 7.9 Hybrid APPC scheme: state variable approach 


Plant 

£ 

II 

S3 Csj 

S3 S3 

ss 

S3 

Reference signal 

Qm{s)y m — 0 

Adaptive law 

Hybrid adaptive law of Table 7.7. 

State observer 

e = Ak-ie + Bk-iu p - A" 0 ( fc _i)[C T e - ei] A k _ i = 

In+q— 1 

“ 01(fc-l) By .- 1 = #2(fc— 1)> C T = 

0 

[1,0,..., 0] K 0 (k-i) = a *~ ^i(fc-i) 0i(fc-i)> ^2(fc-i) 
are the coefficient vectors of R p (s, tk)Q m (s) — s n+cl , 
Z p (s,t k )Qi(s), respectively, a* is the coefficient 
vector of A*(s) — s n+q 

Algebraic 

equation 

Solve for K c (k_\) the equation det [si — A k _i + 
Bk-iK c (k-i)} = A *(s) 

Control law 

i r > s .. Ql(s) 

u p — E-c(k— l) e > u p — p. / \ u p 

Cm. (S) 

Design variables 

Choose Q m , Qi, A*, A*, Q\ as in Table 7.5 


in Tables 7.8 to 7.10 guarantee signal boundedness and convergence of the 
tracking error to zero asymptotically with time. 

The proof of Theorem 7.5.1 is similar to that of the theorems in Sec- 
tion 7.4, with minor modifications that take into account the discontinuities 
in the parameters and is given in Section 7.7. 


Table 7.10 Hybrid adaptive LQ control scheme 
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Plant 

Z p (s) 

Vp= R P (s) Up 

Reference signal 

Qm(s)y m = 0 

Adaptive law 

Hybrid adaptive law of Table 7.7. 

State observer 

e = A k _ ie + B k -iUp - K 0 (k-i)[C T e - ei] 

K 0 (k- 1), A k- 1 , B k - 1 , C as in Table 7.9 

Riccati equation 

Solve for P k ~i = P k ~\ > 0 the equation 

A J_ yPk-i+Pk- \A k _ i~ | P k - \B k - \B \Pk-\+ c C T = 0 

Control law 

- 1 £,T D s Ql{s ) _ 

U P — x u p ~ n ( \ U P 

Design variables 

Choose X,Qi(s),Q m (s) as in Table 7.6 


The major advantage of the hybrid adaptive control schemes described in 
Tables 7.7 to 7.10 over their continuous counterparts is the smaller computa- 
tional effort required during implementation. Another possible advantage is 
better robustness properties in the presence of measurement noise, since the 
hybrid scheme does not respond instantaneously to changes in the system, 
which may be caused by measurement noise. 

7.6 Stabilizability Issues and Modified APPC 

The main drawbacks of the APPC schemes of Sections 7.4 and 7.5 is that 
the adaptive law cannot guarantee that the estimated plant parameters or 
polynomials satisfy the appropriate controllability or stabilizability condi- 
tion at each time t, which is required to calculate the controller parameter 
vector 8 C . Loss of stabilizability or controllability may lead to computational 
problems and instability. 
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In this section we concentrate on this problem of the APPC schemes and 
propose ways to avoid it. We call the estimated plant parameter vector 9 P at 
time t stabilizable if the corresponding algebraic equation is solvable for the 
controller parameters. Because we are dealing with time- varying estimates, 
uniformity with respect to time is guaranteed by requiring the level of sta- 
bilizability to be greater than some constant s* > 0. For example, the level 
of stabilizability can be defined as the absolute value of the determinant of 
the Sylvester matrix of the estimated plant polynomials. 

We start with a simple example that demonstrates the loss of stabiliz- 
ability that leads to instability. 

7.6.1 Loss of Stabilizability: A Simple Example 

Let us consider the first order plant 


y = y + bu (7.6.1) 

where b ^ 0 is an unknown constant. The control objective is to choose u 
such that y,u£ Too, and y(t) — > 0 as t — > oo. 

If b were known then the control law 

u = -\y (7.6.2) 

b 

would meet the control objective exactly. When b is unknown, a natural 
approach to follow is to use the certainty equivalence control (CEC) law 

u c = -j y (7.6.3) 

b 

where b{t ) is the estimate of b at time t, generated on-line by an appropriate 
adaptive law. 

Let us consider the following two adaptive laws: 

(i) Gradient 

b = 7 4>e , 6(0) = 6 o / 0 (7.6.4) 

where 7 > 0 is the constant adaptive gain. 
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(ii) Pure Least-Squares 


b = P<j)E , 6(0) = 6o / 0 

p = - p 2 rw- = 

where P, 0 £ 1Z 1 , 


£ 


z 


z — 60 1 

l + M 2 ’ Vf = J+\ y ' 

Vf-Vf 


s + 1 


(7.6.6) 


It can be shown that for (3q > 0, the control law (7.6.3) with 6 generated 
by (7.6.4) or (7.6.5) meets the control objective provided that 6(f) / 0 Vf > 0. 

Let us now examine whether (7.6.4) or (7.6.5) can satisfy the condition 
6(f) / 0, Vf > 0. 

From (7.6.1) and (7.6.6), we obtain 


60 

1 + /3 O 0 2 


(7.6.7) 


where 6 = 6 — 6 is the parameter error. Using (7.6.7) in (7.6.4), we have 

b= ’ S(0) = So (7.6.8) 

Similarly, (7.6.5) can be rewritten as 


b = -P 


m = 


1 + /3 O 0 2 

Po 




( 6 - 6 ), 6 ( 0 ) = 6 0 

Po > 0 


(7.6.9) 


It is clear from (7.6.8) and (7.6.9) that for 6(0) = 6, 6(f) = 0 and 6(f) = 
6, Vf > 0; therefore, the control objective can be met exactly with such an 
initial condition for 6. 

If 0(f) = 0 over a nonzero finite time interval, we will have 6 = 0, u = 
y = 0, which is an equilibrium state (not necessarily stable though) and the 
control objective is again met. 
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Figure 7.5 Output y(t) versus estimate 6(f) for different initial conditions 
y( 0) and 6(0) using the CEC u c = —2y/b. 

For analysis purposes, let us assume that 6 > 0 (unknown to the designer). 
For 4> / 0, both (7.6.8), (7.6.9) imply that 

sgn(6) = — sgn(6(f) — b) 

and, therefore, for b > 0 we have 

6(f) > 0 if 6(0) < 6 and 6(f) < 0 if 6(0) > 6 

Hence, for 6(0) < 0 < 6, 6(f) is monotonically increasing and crosses zero 

leading to an unbounded control u c . 

Figure 7.5 shows the plots of y(t) vs 6(f) for different initial conditions 
6(0), y( 0), demonstrating that for 6(0) < 0 < 6, 6(f) crosses zero leading to 
unbounded closed-loop signals. The value of 6 = 1 is used for this simulation. 

The above example demonstrates that the CEC law (7.6.3) with (7.6.4) 
or (7.6.5) as adaptive laws for generating 6 is not guaranteed to meet the 
control objective. If the sign of 6 and a lower bound for |6| are known, 
then the adaptive laws (7.6.4), (7.6.5) can be modified using projection to 
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constrain b(t) from changing sign. This projection approach works for this 
simple example but its extension to the higher order case is awkward due to 
the lack of any procedure for constructing the appropriate convex parameter 
sets for projecting the estimated parameters. 

7.6.2 Modified APPC Schemes 

The stabilizability problem has attracted considerable interest in the adap- 
tive control community and several solutions have been proposed. We list 
the most important ones below with a brief explanation regarding their ad- 
vantages and drawbacks. 

(a) Stabilizability is assumed. In this case, no modifications are introduced 
and stabilizability is assumed to hold for all t > 0. Even though there is no 
theoretical justification for such an assumption to hold, it has been often 
argued that in most simulation studies, no stabilizability problems usually 
arise. The example presented above illustrates that no stabilizability prob- 
lem would arise if the initial condition of 6(0) happens to be in the region 
6(0) > 6. In the higher order case, loss of stabilizability occurs at certain 
isolated manifolds in the parameter space when visited by the estimated 
parameters. Therefore, one can easily argue that the loss of stabilizability 
is not a frequent phenomenon that occurs in the implementation of APPC 
schemes. 

(b) Parameter projection methods [73, 109, 111]. In this approach, the adap- 
tive laws used to estimate the plant parameters on-line are modified using 
the gradient projection method. The parameter estimates are constrained 
to lie inside a convex subset Co of the parameter space that is assumed to 
have the following properties: 

(i) The unknown plant parameter vector 6 * € Co- 

(ii) Every member 9 P of Co has a corresponding level of stabilizability greater 

than e* for some known constant e* > 0. 

We have already demonstrated this approach for the scalar plant 
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In this case, the estimated polynomials are s + a,b which, for the APPC 
schemes of Section 7.4 to be stable, are required to be strongly coprime. 
This implies that b should satisfy \b(t)\ > bo for some bo > 0 for all t > 0. 
The subset Co in this case is defined as 

Co = {& G 'R}\ bsgn(b) > &o| 

where the unknown b is assumed to belong to Co, i.e. , |6| > bo- As shown in 
Examples 7.4.1 and 7.4.2, we guaranteed that \b(t)\ > bo by using projection 
to constrain b(t) to be inside Co Vi > 0. This modification requires that bo 
and the sgn{b ) are known a priori. 

Let us now consider the general case of Sections 7.4 and 7.5 where the 
estimated polynomials R p (s, t)Q m (s), Z p (s, t ) are required to be strongly co- 
prime. This condition implies that the Sylvester matrix S e (9 p ) of R p (s,t ) 
Qm(s), Z p (s,t) satisfies 

| det5 e (0 p )| > e* 

where 9 P G IZ 2n is the vector containing the coefficients of R p (s,t ) — s n and 
Z p (s,t), and e* > 0 is a constant. If e* > 0 is chosen so that 

|det«S c (0p)| > e* > 0 

where 9* G IZ 2n is the corresponding vector with the coefficients of the 
unknown polynomials R p (s), Z p (s), then the subset Co may be defined as 

Co = convex subset of V G lZ 2n that contains 9* 

where 

V=\9 p e Il 2n \ | detS e (0 p )| > e* > o} 

Given such a convex set Co, the stabilizability of the estimated parameters 
at each time t is ensured by incorporating a projection algorithm in the 
adaptive law to guarantee that the estimates are in Co, Vi > 0. The projection 
is based on the gradient projection method and does not alter the usual 
properties of the adaptive law that are used in the stability analysis of the 
overall scheme. 

This approach is simple but relies on the rather strong assumption that 
the set Co is known. No procedure has been proposed for constructing such 
a set Co for a general class of plants. 

An extension of this approach has been proposed in [146] . It is assumed 
that a finite number of convex subsets C\, , C p are known such that 
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(i) 9* £ O p i=1 Ci and the stabilizability degree of the corresponding plant is 

greater than some known e* > 0. 

(ii) For every 9 P £ U ? =1 Cj the corresponding plant model is stabilizable with 

a stabilizability degree greater than e* . 

In this case, p adaptive laws with a projection, one for each subset C*, 
are used in parallel. A suitable performance index is used to select the 
adaptive law at each time t whose parameter estimates are to be used to 
calculate the controller parameters. The price paid in this case is the use 
of p parallel adaptive laws with projection instead of one. As in the case 
of a single convex subset, there is no effective procedure for constructing 
Ci, i = 1,2 ,...,p, with properties (i) and (ii) in general. The assumption, 
however, that 9* £ U p i=1 Ci is weaker. 

(c) Correction Approach [40]. In this approach, a subset V in the parameter 
space is known with the 

following properties: 

(i) 9* £ D and the stabilizability degree of the plant is greater than some 

known constant e* > 0 . 

(ii) For every 9 p £ V, the corresponding plant model is stabilizable with a 

degree greater than e* . 

Two least-squares estimators with estimates 9 Pl 9 P of 9* are run in par- 
allel. The controller parameters are calculated from 9 P as long as 9 P £ V. 
When 9 p 0 V, 9 P is reinitialized as follows: 

0 P = 9 P + P 1 / 2 7 

where P is the covariance matrix for the least-squares estimator of 9*. and 
7 is a vector chosen so that 9 P £ V. The search for the appropriate 7 can be 
systematic or random. 

The drawbacks of this approach are (1) added complexity due to the 
two parallel estimators, and ( 2 ) the search procedure for 7 can be tedious 
and time-consuming. The advantages of this approach, when compared with 
the projection one, is that the subset V does not have to be convex. The 
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importance of this advantage, however, is not clear since no procedure is 
given as to how to construct V to satisfy conditions (i), (ii) above. 

(d) Persistent excitation approach [17, 49]. In this approach, the reference 
input signal or an external signal is chosen to be sufficiently rich in frequen- 
cies so that the signal information vector is PE over an interval. The PE 
property guarantees that the parameter estimate 0 p of 9* converges expo- 
nentially to 6* (provided the covariance matrix in the case of least squares 
is prevented from becoming singular). Using this PE property, and by as- 
suming that a lower bound e* > 0 for the stabilizability degree of the plant 
is known, the following modification is used: When the stabilizability de- 
gree of the estimated plant is greater than s*, the controller parameters 
are computed using 9 P ; otherwise the controller parameters are frozen to 
their previous value. Since 9 p converges to 6*, the stabilizability degree of 
the estimated plant is guaranteed to be greater than e* asymptotically with 
time. 

The main drawback of this approach is that the reference signal or ex- 
ternal signal has to be on all the time, in addition to being sufficiently rich, 
which implies that accurate regulation or tracking of signals that are not rich 
is not possible. Thus the stabilizability problem is overcome at the expense 
of destroying the desired tracking or regulation properties of the adaptive 
scheme. Another less serious drawback is that a lower bound e* > 0 for the 
stabilizability degree of the unknown plant is assumed to be known a priori. 

An interesting method related to PE is proposed in [114] for the sta- 
bilization of unknown plants. In this case the PE property of the signal 
information vector over an interval is generated by a “rich” nonlinear feed- 
back term that disappears asymptotically with time. The scheme of [114] 
guarantees exact regulation of the plant output to zero. In contrast to other 
PE methods [17, 49], both the plant and the controller parameters are esti- 
mated on-line leading to a higher order adaptive law. 

(e) The cyclic switching strategy [166] . In this approach the control input is 
switched between the CEC law and a finite member of specially constructed 
controllers with fixed parameters. The fixed controllers have the property 
that at least one of them makes the resulting closed-loop plant observable 
through a certain error signal. The switching logic is based on a cyclic 
switching rule. The proposed scheme does not rely on persistent excitation 
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and does not require the knowledge of a lower bound for the level of stabi- 
lizability. One can argue, however, that the concept of PE to help cross the 
points in the parameter space where stabilizability is weak or lost is implic- 
itly used by the scheme because the switching between different controllers 
which are not necessarily stabilizing may cause considerable excitation over 
intervals of time. 

Some of the drawbacks of the cyclic switching approach are the complex- 
ity and the possible bad transient of the plant or tracking error response 
during the initial stages of adaptation when switching is active. 

(f) Switched- excitation approach [63]. This approach is based on the use 
of an open loop rich excitation signal that is switched on whenever the 
calculation of the CEC law is not possible due to the loss of stabilizability of 
the estimated plant. It differs from the PE approach described in (d) in that 
the switching between the rich external input and the CEC law terminates 
in finite time after which the CEC law is on and no stabilizability issues arise 
again. We demonstrate this method in subsection 7.6.3. 

Similar methods as above have been proposed for APPC schemes for 
discrete-time plants [5, 33, 64, 65, 128, 129, 189, 190]. 


7.6.3 Switched-Excitation Approach 

Let us consider the same plant (7.6.1) and control objective as in subsec- 
tion 7.6.1. Instead of the control law (7.6.3), let us propose the following 
switching control law 


u = 


-2 y/h if t G [0, ti) U (t k + j k r , t k+ 1) 
c if t e [t k ,t k + j k r\ 


(7.6.10) 


where c / 0 is a constant. 


b = P(j)£ , 6(0) = 6q / 0 


(7.6.11) 


P 2 cj) 2 
1 + M 2 ’ 


P( 0 ) 


P(t k ) 


P(t k + jr) = Pol > o, j 


1) 2, . . , , j k 


/3o 


l i£te[0,ti)u(t k +j k T,t k+ i) 

0 if t G [t k ,t k + j k r\ 


where e, cj) are as defined in (7.6.6), and 
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• k = 1 , 2 ,... 

• t\ is the first time instant for which 

|6(fi)| = z/(l) 


where 



Ki) 

k 


k = 1 , 2 ,... 


• tk is the first time instant after t = t k ~ i + jk- i T for which 


\b{t k )\ = v{k + Y^Ji) = — 7- 

i = 1 k + Mi=l Ji 

where j k = 1 , 2 ,... is the smallest integer for which 

k 

\b{t k +j k T)\ > v(k + ^2,ji) 
i = 1 


and r > 0 is a design constant. 


Even though the description of the above scheme appears complicated, 
the intuition behind it is very simple. We start with an initial guess bo ^ 0 
for b and apply u = u c = —2 y/b. If \b(t)\ reaches the chosen threshold 
i/(l) = |&o|/2, say at t = t\, u switches from u = u c to u = u r = c / 0, 
where u r is a rich signal for the plant considered. The signal u r is applied for 
an interval r, where r is a design constant, and \b(t\ + t)| is compared with 
the new threshold u{2) = z/(l)/2. If \b(ti + r)\ > z/(2), then we switch back to 
u = u c at t = t\ + r. We continue with u = u c unless 1 6(^2) | = z/(3) = 
for some finite t -2 in which case we switch back to u = u r . If \b(t\ + r)| < z/(2), 
we continue with u = u r until t = fi + 2r and check for the condition 

|&(f! + 2r)| > 1/(3) = ^ (7.6.12) 


If 6(ti + 2r) satisfies (7.6.12), we switch to u = u c and repeat the same 
procedure by reducing the threshold v(k) at each step. If b(t\ + 2 r) does 
not satisfy (7.6.12) then we continue with u = u r for another interval r and 
check for 


\b(h + 3r)| > z/(4) 


*/(!) 


4 
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and repeat the same procedure. We show that the sequences 6fc, v(k) con- 
verge in a finite number of steps, and therefore |6(6)| > v* > 0 for some 
constant v* and u = u c for all 6 greater than some finite time 6*. 

In the sequel, for the sake of simplicity and without loss of generality, 
we take c = 1,pqt = 1 and adjust the initial condition 4>(tk) for the filter 
4> = to be equal to <j)(tk) = 1 so that 4>(t) = 1 for i G [^,^4 jkj] and 
\/k > 1. 

Let us start with a “wrong” initial guess for 6(0), i.e. , assume that 6 > 0 
and take 6(0) < 0. Because 

b = -P(t> 2 (b - 6)/(l + M 2 ) 

and P(t) > 0, for any finite time 6, we have that 6 > 0 for b(t) < 6 where 
6(6) = 0 if and only if cp(t) = 0. Because 4>(t) = 1 V6 > 0, 6(6), starting from 
6(0) < 0, is monotonically increasing. As 6(6) increases, approaching zero, it 
satisfies, at some time 6 = 6i, 

6(6i) = -i/(l) = — 16 0 |/2 


and, therefore, signals the switching of the control law from u = u c to u=u r , 
i.e., for 6 > 6i, we have 

u = u r = 1 

P = -P 2 4>\ P(h)= Po (7.6.13) 

b= -P<p 2 {b-b) (7.6.14) 

The solutions of (7.6.13) and (7.6.14) are given by 


m = 
6 ( 6 ) = 6 + 


Po 


1 +Po(t - 6i) 

1 


, 6 > 6i 


(6(6i) - 6), 6 > 6i 


1 +Po(t - 6i) 

We now need to monitor 6(6) at 6 = 6i + j\T. j i = 1,2,... until 


|6(6i + jir)| 


v(l ) + b 

1 + ji 


u(l) 


> 


1 + 3 1 


(because pqt = 1) (7.6.15) 
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is satisfied for some j\ and switch to u = u c at t = H + j\T. We have 

\b(h + jxT)\ = ^ (7.6.16) 

1 + Ji 

Let jl be the smallest integer for which bj\ > zz(l). Then, Vji > j*. condition 
(7.6.15) is the same as 

\b{ti + h T )\ = Kh+hr) = — (7.6.17) 

1+Ji 1 + 3\ 

Hence, for j\ = 2 j*, (7.6.17) is satisfied, i.e., by applying the rich signal 
u = u r = 1 for 2 j* intervals of length r, b(t) passes through zero and 
exceeds the value of u* = zz(l)/(l + 2 j*) > 0. Because b(t) > 0, we have 
b(t) > v* , \/t > t\ + 2jlr and therefore u = u c = —2y/b, without any further 
switching. 

Figure 7.6 illustrates typical time responses of y. u and b when the 
switched-excitation approach is applied to the first order plant given by 
(7.6.1). The simulations are performed with 6=1 and 6(0) = —1.5 . At 
t = t\ « 0.2s, 6(H) = i^(l) = 6(0)/2 = —0.75, the input u = u c = —2y/b 
is switched to u = u r = 1 for a period r = 0.25s. Because at time t = 
t\ +r, 6(t i +t) is less than v(2) = l)/2, u = u r = 1 is applied for another 

period r. Finally, at time t = H + 2 r, 6(t) > u(3) = i/(l)/3, therefore u 
is switched back to u = u c . Because b{t) > zz(l)/3, Vt > 0.7s, no further 
switching occurs and the exact regulation of y to zero is achieved. 


General Case 

The above example may be extended to the general plant (7.3.1) and control 
objectives defined in the previous subsections as follows. 

Let u c denote the certainly equivalence control law based on any one of 
the approaches presented in Section 7.4. Let 

C d (6 p ) = det|S e (0 p )| 
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Figure 7.6 The time response of the output y(t), control input u(t) and 
estimate b(t) for the switched-excitation approach. 


where S e (9 p ) denotes the Sylvester matrix of the estimated polynomials 
R p (s,t) Qm(s), Z p (s,t ) and 9 p (t ) is the vector with the plant parameter 
estimates. 

Following the scalar example, we propose the modified control law 


u = 


u c (t) if te [0, ti) U (t k + jkT, tk+i) 
u r (t ) if t e [t k ,t k + j k r\ 


( 7 . 6 . 18 ) 
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Adaptive Law 


P = 


9 P = Peel), 

P<M> T P 
l + M T (j)' 


0 P (o) = e 0 


p{ o) = p 0 = p 0 t > o 


where 

e = (z- dj (t>) / (l + 4>) 

and 6*0 is chosen so that 0^(0 o) > 0. Furthermore, 

-P(O) = P(tfe) = + jr) = k^ 1 1, j = 1,2,... ,j k 


where &o = constant > 0, and 


A) 


l if t e [0, ti) u (t k + jkT,t k +i) 

0 if t <E [t k ,t k + j k r\ 


(7.6.19) 


(7.6.20) 


(7.6.21) 


where 

• k = 1 , 2 ,... 

• t\ is the first time instant for which Cd(9 p (ti)) = i/(l) > 0, where 

v{ ' k) = Cd ^r = vi k L ^ = ( 7 - 6 - 22 ) 

• t k (k > 2) is the first time instant after t = t k _\ + j k —\T for which 


C d (9 p (t k )) = ^ + E 3i) = n ■ 7 ( 7 - 6 - 23 ) 

t=l + Ai=i 3i) 

and j k = 1 , 2 ,... is the smallest integer for which 

k 

Cd(0 p (t k + j k r)) > v(k + EE) (7.6.24) 

i=l 

where r > 0 is a design constant. 

• u c (t) is the certainty equivalence control given in Section 7.4. 

• u r (t) is any bounded stationary signal which is sufficiently rich of order 
2 n. For example, one can choose 

n 

u r (t) = E A * sincujt 
i= 1 
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where A; / 0, i = 1, . . . , n, and u / Uj for i / j. 

From (7.6.18), we see that in the time intervals ( tk + jkT, tfe+i), the sta- 
bilizability degree Cd(0 p (t )) is above the threshold u(k + J2i=iji) and the 
adaptive control system includes a normalized least-squares estimator and 
a pole placement controller. In the time intervals [tk+i, ifc+i + jk+\ T ], the 
control input is equal to an external exciting input u r {t) and the parameter 
vector estimate 9 p (t ) is generated by an unnornralized least-squares estima- 
tor. 

The switching (at time t = tk) from the pole placement control u c to the 
external rich signal u r occurs when the stabilizability degree Cd(9 p (t )) of the 
estimated model reaches the threshold v{k+J2i = l Ji)- We keep applying u = 
u r during successive time intervals of fixed length r, until time t = tk + jkT 
for which the condition Cd{0 p {tk + jkT)) > v(k + Y)%= l ji) is satisfied and u 
switches back to the CEC law. The idea behind this approach is that when 
the estimated model is stabilizable, the control objective is pole placement 
and closed-loop stabilization; but when the estimation starts to deteriorate, 
the control priority becomes the “improvement” of the quality of estimation, 
so that the estimated parameters can cross the hypersurfaces that contain 
the points where Cd{0 p ) is close to zero. 

The following theorem establishes the stability properties of the proposed 
adaptive pole placement scheme. 

Theorem 7.6.1 All the signals in the closed-loop (7.3.1) and (7.6.18) to 
(7.6.24) are bounded and the tracking error converges to zero as t — > 00 . 
Furthermore, there exist finite constants u* ,T* >0 such that for t > T* , we 
have Cd{0 p (t)) > u* and u(t) = u c . 

The proof of Theorem 7.6.1 is rather long and can be found in [63]. 

The design of the switching logic in the above modified controllers is 
based on a simple and intuitive idea that when the quality of parameter es- 
timation is “poor,” the objective changes from pole placement to parameter 
identification. Parameter identification is aided by an external open-loop 
sufficiently rich signal that is kept on until the quality of the parameter es- 
timates is acceptable for control design purposes. One of the advantages of 
the switched-excitation algorithm is that it is intuitive and easy to imple- 
ment. It may suffer, however, from the same drawbacks as other switching 
algorithms, that is, the transient performance may be poor during switching. 
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The adaptive control scheme (7.6.18) to (7.6.24) may be simplified when a 
lower bound v* > 0 for Cd(0*) is known. In this case, v(k) = v* Vk. In the 
proposed scheme, the sequence v{k) converges to v* and therefore the lower 
bound for Cd(0*) is also identified. The idea behind the identification of v* 
is due to [189] where very similar to the switched-excitation approach meth- 
ods are used to solve the stabilizability problem of APPC for discrete-time 
plants. 


7.7 Stability Proofs 

In this section we present all the long proofs of the theorems of the previous sub- 
sections. In most cases, these proofs follow directly from those already presented 
for the simple examples and are repeated for the sake of completeness. 

7.7.1 Proof of Theorem 7.4.1 

Step 1 . Let us start, by establishing the expressions (7-4.24). We rewrite the control 
law (7.4.23) and the normalized estimation error as 


LQm — P ^ (jjp Vm) 

(7.7.1) 

= z — 0 p (j) = R p -—y p — Zp — Up 

1Y V 4Vr> 

(7.7.2) 


where A(s), A p (s) are monic, Hurwitz polynomials of degree n+q—1, n, respectively, 
R p = s n +0j a„_i(s), Z p = Oj a„_i(s). From Table 7.4, we have A(s) = A p (s)A g (s), 
where A 9 (s) is a monic Hurwitz polynomial of degree q—1. This choice of A simplifies 
the proof. We should point out that the same analysis can also be carried out with 
A, A p being Hurwitz but otherwise arbitrary, at the expense of some additional 
algebra. 

Let us define 

A 1 A 1 

u f = Vf = jVv 

and write (7.7.1), (7.7.2) as 

Py f A LQ m Uf Vinli RpA q y f ZpAq'Uf 677 7. (7.7.3) 

where y m i = P\y m € C^. By expressing the polynomials R p (s)A q (s), Z p (s)A q (s), 
P(s), L(s)Q m (s) as 

R p (s)A q (s) = s n+9-1 + dj a n+q - 2 (s), Z p (s)A q (s) = Oj a n+q - 2 (s) 
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P(s) = p 0 s n+q 1 + p T a n+q - 2 (s), L(s)Q m (s) = s n+q 1 + F a n+q - 2 (s ) 
we can rewrite (7.7.3) in the form of 

yO+q-i ) _ _gj a n+q - 2 (s)yf + dj a n+q - 2 (s)u f + em 2 

u (n+q-l) _ _ py an+q _ 2 ^ y f _ ( po 0 2 +I) T a n+q _ 2 ( S ) U f (7.7.4) 

-p 0 em 2 + ymi 

where the second equation is obtained from (7.7.3) by substituting for y^ l+q F 
Defining the state x = y^ n+q 2 \ ■ ■ ■ ,Vf, Uf, ity n+9 2 \ • • • , we obtain 

x = A(t)x + b l {t)em 2 +b 2 y ml (7.7.5) 

where 

-oj I ej 

o 

In+q-2 ; I 0( n+q _ 2)x(n+g-l) 

0 

PoOj -p 1 | -p 0 9j ~F 

0 

0(n+q-2)x(n+q-l) I In+q-2 ; 

0 

0 („+q_ 2 ) X (n+ 9 -i) is an (n + q — 2) by (?i + q — 1) matrix with all elements equal 
to zero. Now, because u p = A Uf = Fj- l+q ^ + \ T a n + q - 2 (s)uf, y p = Ayj = 

y < f l+q ~ 1 ' > + A T a n+q _ 2 (s)yf where A is the coefficient vector of A(s) — s n+q ~ 1 , we 
have 

u p = [0, . . . ,0, 1 ,0, „ . , Oj x + [0, . . . , 0, A t ]x 

n-\-q— 1 n-\-q— 1 n-\-q— 1 

y p = [l,0,...,q ,0,...,0ja;+[A T ,0,.. Oja; (7.7.6) 

n-\-q— 1 n-\-q— 1 n-\-q— 1 

Step 2. Establish the e.s. of the homogeneous part of (7.7 .5). Because P,L 
satisfy the Diophantine equation (7.4.21), we can show that for each frozen time 1 t, 

det (si - A(t)) = R p A q ■ LQ m + P ■ Z p A q = A* A q (7.7.7) 

1 X ■ Y denotes the algebraic product of two polynomials that may have time- varying 
coefficients. 
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i.e., A(t) is a stable matrix for each frozen time t. One way to verify (7.7.7) is to 
consider (7.7.3) with the coefficients of R p , Z p , P, L frozen at each time t. It follows 


from (7.7.3) that 

Vf = 


1 


{L(f rn ern + ZpAqy m i ) 


RpAq ' LQ m T P ■ ZpAq 
whose state space realization is given by (7.7.5). Because 

RpAq ■ LQm + P ' ZpAq = A* Aq 


(7.7.7) follows. 

We now need to show that ||A(t)|| £ £ 2 ,||A(t)|| £ Coo from the properties 
9 P £ £oo and 9 P £ C 2 which are guaranteed by the adaptive law of Table 7.4. Using 
the assumption that the polynomials R p Q m and Z p are strongly coprime at each 
time t, we conclude that the Sylvester matrix Si (defined in Table 7.4) is uniformly 
nonsingular, i.e., |det(S/)| > vq for some vq > 0, and thus 9 P £ C oo implies that 
Si, Sf 1 G L-oq. Therefore, the solution $1 of the algebraic equation Si$i = a * which 
can be expressed as 

Pi = 

is u.b. On the other hand, because 9 P £ Coo and 9 p £ £ 2 , it follows from the 
definition of the Sylvester matrix that ||5/(t)|| £ £ 2 . Noting that 

hi = -s-'bis- 1 ^ 

we have /3 t £ £ 2 which is implied by £ Coo and ||«S/(t)|| £ C 2 - 

Because the vectors 9-\ , 62 , p. I are linear combinations of 9 P , /3; and all elements 
in A(t) are uniformly bounded, we have 

\\Am<c(\'m\ + \9 P m 

which implies that ||zl(t)|| £ C 2 . Using Theorem 3.4.11, it follows that the homo- 
geneous part of (7.7.5) is e.s. 

Step 3. Use the properties of the C 2 s norm and B-G Lemma to establish 
boundedness. As before, for clarity of presentation, we denote the C 2 $ norm as || • ||, 
then from Lemma 3.3.3 and (7.7.5) we have 

||x|| < c||em 2 || + c, \x(t)\ < c\\em 2 \\ + c (7.7.8) 

for some S > 0. Defining rrij = 1 + ||y p || 2 + ||w p || 2 , it follows from (7.7.5), (7.7.6), 

(7.7.8) and Lemma 3.3.2 that (f>,m< mf and 

m 2 < c + c||x|| 2 + c||em 2 || 2 < c||em?n/|| 2 + c 


(7.7.9) 



516 


CHAPTER 7. ADAPTIVE POLE PLACEMENT CONTROL 


i.e., 

mj<c + c f e _1 ' (t_r ' ) e 2 (r)m 2 (T)?Ti^(T)dr 
Jo 

Because em € C 2 , guaranteed by the adaptive law, the boundedness of m/ can be 
established by applying the B-G Lemma. The boundedness of the rest of the signals 
follows from mj £ and the properties of the C 2b norm that is used to show that 
m.f bounds most of the signals from above. 

Step 4. Establish that the tracking error converges to zero. The tracking 
properties of the APPC scheme are established by manipulating (7.7.1) and (7.7.2) 
as follows: From the normalized estimation error equation we have 

2 Ai _ A 1 

cm — Rp Up Zp Up 

i\p i\p 


Filtering each side of the above equation with LQ m where L(s, t) = s n 1 +a^ l _ 2 (s)l 
and l c = [1,/ T ] T is the coefficient vector of L(s,t ), it follows that 

I-Qm ^ = LQmj- - Z p ^-u^j (7.7.10) 


Noting that A = A q A p , and applying the Swapping Lemma A.l, we obtain the 
following equations: 


Qm r. 1 


Qn 


Qn 


A q P A P 


\ Rp ^ Up — Rp ^ Up + r i? A ( Z p A U P — Zp A Up + r 2 (7.7.11) 


A q V "Rp 


Qn 


A 


where 


A 


I-iM 


A 


r 1 = W cl (s) ( ( W bl (s) i' y P 0 a ,r 2 = IU cl (s) ( ( W bl {s) A u p | u b 


Q^-i( s ) . 
A. n 


and W c i,W b i are as defined in Swapping Lemma A.l with W = Because 

u p ,y p € £oo and 9 ai 0 b € C 2 , it follows that ri,r 2 £ C 2 . Using (7.7.11) in (7.7.10), 
we have 


LQ m -^-(em 2 ) = L (jtp^-yp - Z p ( ^-u p + rq - r 2 ^ (7.7.12) 

Noting that Q m yp = Qm(e 1 + Vm) = 1 , we can write (7.7.12) as 

LQr„ l Urn 2 ) = L ei + n - r 2 ) - L (z p Q ™u^j (7.7.13) 
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Applying the Swapping Lemma A. 4 (i) to the second term in the right-hand side of 
(7.7.13), we have 

L = Z. p + r 3 (7.7.14) 

(by taking ^«j, = / in Lemma A. 4 (i)), where Z p = a^_ 1 (s)9b and 

r 3 = a^_ 2 A(Z,6» 6 )a n _ 2 (s)^-Wp 

A(s) 

where ||F(/,^)|| < ci\l\ + C2\0b\ for some constants ci,C 2 > 0. Because 

Z/(5,t)Q m (s) L(s,t) • Qm(s) 
we use the control law (7.7.1) to write 

T( ,\ Qm(^) D ^ /« 7 1 r\ 

= i(M) ■ -a 5T“- = - p Xw ei ( ’ 

Substituting (7.7.15) in (7.7.14) and then in (7.7.13), we obtain 

LQ m ^-(em 2 ) = L (^X[s) 6l ) + ^ ri - r2 ^ “ r3 ( 7 - 7 - 16 ) 

According to Swapping Lemma A. 4 (ii) (with f = \e \ and Aq(s) = 1), we can 


write 




— 7/(<S, £) Rp(s, t)Qm(s) . , x Ci H - 7*4 

A \s) 


Zr ( F A(,) ei ) 

- Z p (s,t) ■ P(s,t) e i + r 5 

A(s) 

(7.7.17) 

where 



r 4 = aJ_ 1 (s)G(s,e 1 . 

.1,0a), r 5 = a n-l( s )U' , ( s , e i,9 b ,p) 

(7.7.18) 


and G(s,ei,l,9 a ),G (s, ei, 9b, p) are defined in the Swapping Lemma A. 4 (ii). Be- 
cause e\ G Coo and P, 9 a , 9b G £ 2 , it follows from the definition of G,G that 
G,G' G C- 2 - Using (7.7.17) in (7.7.16) we have 

LQ m ^-{em 2 )=[L{s,t) ■ R p (s,t)Q m (s) + Z p ■ P(s,t)j ^yei + Z(ri-r- 2 )-r 3 -|-r- 4 -l-r 5 

(7.7.19) 

In the above equations, we use X(s, t) to denote the swapped polynomial of A(s, t), 
i.e., 

X(s,t) = pj (t)a n (s), X(s,t ) = aj(s)p x (t) 
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Because 


we have 


L(s, t) ■ R p (s, t)Q m (s) + Z p (s , <) • P(s, t) = A*(s) 


L(s, t) ■ R p (s, t)Q m (s ) + Z p (s, t ) • P(s, f) = A*(s) = A*(s) 


where the second equality holds because the coefficients of A*(s) are constant. 
Therefore, (7.7.19) can be written as 

■ 4 ’ W A5) ei = ’’ 


where 


. - A ( fi ) . 

Sl A*{s) 


v = L[ Q m ——(em 2 ) - n + r 2 ) + r 3 - r 4 - r 5 

V A «(s) J 


Because L(s) = a^_ 1 (s)/ c , l c = [1,Z T ] T we have 


v = a^_ 1 (s)Z c [^ A (em 2 ) - n + r 2 ] + r 3 - r 4 - r 5 


Therefore, it follows from (7.7.20) that 


(7.7.20) 


A(s) T / \, Qm(s) / 2 , i 

ei = 3y7)“"- i(s)i 'iyw <e ' n)+ ’' 0 


(7.7.21) 


where 


P) = ^^y[aI_i(s)Z c (r 2 -ri) + a^_ 2 (s)ui-a^„ 1 (s)u 2 ] 

Vi = F(l, 0 b )a n - 2 (s) Up 

A (s) 

v 2 = G(s, ei,l,9 a ) +G (s,ei,p, 9 b ) 


A s ) a 0-i( s ) a„- 2 (s)Q„ 1 (s) 


Because — — , " Ap,) 1 are strictly proper and stable, l c € Coo, and 
f\ ,r 2 ,v\, v 2 € C 2 , it follows from Corollary 3.3.1 that vq € C 2 and vq ( t) — > 0 as 
t — > oo. 

Applying the Swapping Lemma A.l to the first term on the right side of (7.7.21) 
we have 


ei = ( 


T a n-l(s)A(s)Q m (s) 2 


A*(s)A 9 (s) 


(era ) + W c (s)(W b (s)(em ))l c + v 0 
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where W c (s),Wb(s) have strictly proper stable elements. Because <Xr ' A l A mA is 
proper, l c , an £ £ oo fj £2 and zjq £ £00 fl £ 2 , it follows that e\ £ C^ f) £ 2 . 

Zi 

The plant equation y p = ^ 7u v assumes a minimal state space representation of 
the form 

Y = AY + Bu p 
y v = C t y 

where ( C , A) is observable due to the coprimeness of Z p , R p . Using output injection 
we have 

Y = (A — KC t )Y + Bu p + Ky p 

where K is chosen so that A co = A — KC T is a stable matrix. Because y p , u p £ £oo 
and A co is stable, we have Y £ £oo, which implies that Y, y p £ Coq. Therefore, 
e-i = y p — y m £ Coo, which, together with e\ £ £2 and Lemma 3.2.5, implies that 
ei(i) — > 0 as t — > 00 . 


7.7.2 Proof of Theorem 7.4.2 


Step 1. Develop the closed-loop state error equations. We start by representing 
the tracking error equation 


ei 


Z P Qi 

RpQn 


in the following state-space form 


e = 


In+q—l 

0 


e + 0* 2 u p , e £ TZ n+q 


(7.7.22) 


d = C T e 

where C T = [1, 0, . . . , 0] £ lZ n+q and 9\, 0 2 are the coefficient vectors of R p Q m — 
s n+q , Z p Qi, respectively. 

Let e 0 = e — e be the state observation error. Then from the equation for e in 
Table 7.5 and (7.7.22), we have 

e = A c (t)e + K 0 C T e 0 

e Q = A 0 e 0 + ei — 02Up (7.7.23) 


where 



* 

—a 

In-\-q— 1 



0 
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is a stable matrix; A c (t) = A — BK C and 9\ = 9\ — 9(, 62 = 62 — 0 %. The plant input 
and output satisfy 


y P 

Up 


C T e 0 + C T e + y m 


RpXQi _ QiYRp 

u p _l yp 


(7.7.24) 


where A(s), Y (s) are polynomials of degree n + q— 1 that satisfy (7.3.25) and A*(s) 
is a Hurwitz polynomial of degree 2(n+q) — 1. Equation (7.7.24) for u p is established 
in the proof of Theorem 7.3.1 and is used here without proof. 


Step 2. Establish e.s. for the homogeneous part of (7.7.23). Let us first 
examine the stability properties of A c (t) in (7.7.23). For each frozen time t, we 
have 

det(sJ - A c ) = det (si -A + BK C ) = A* (s) (7.7.25) 

i.e., A c (t) is stable at each frozen time t. If Z p (s,t)Qi(s), R p (s,t)Qm ( s ) are strongly 
coprime, i.e., (A, B) is strongly controllable at each time t, then the controller gains 
K c may be calculated at each time t using Ackermann’s formula [95], i.e., 

k c = [o,o,...,o,i]G~ 1 A* c (A) 

where 

Q c = [B, Ab, . . . , A n+q ~ 1 B\ 

is the controllability matrix of the pair (A, B). Because ( A,B ) is assumed to be 
strongly controllable and A, B € £00 due to 9 P € £ 00 , we have K c e £00 • Now, 

k c = [0,0, ... ,0, 1] cG?a* c (A) + g^±A* c (A ) J 


Because 9 p £ and 9 p e £ 2 , it follows that ||AT c (t)|| £ £ 2 , which, in turn, implies 
that ||A c (f)|| £ £ 2 . From A c being pointwise stable and j|A c (t)|| £ £ 2 , we have that 
A c (t) is a u.a.s matrix by applying Theorem 3.4.11. If Z p (s, t)Qi(s), R p (s, t)Q m (s) 
are not strongly coprime but Z p (s,t), R p (s,t)Q m (s) are, the boundedness of K c 

and ||A: C (<)|| £ £ 2 can still be established by decomposing (A,B) into the strongly 
controllable and the stable uncontrollable or weakly controllable parts and using 
the results in [95] to obtain an expression for K c . Because A 0 is a stable matrix the 
homogeneous part of (7.7.23) is e.s. 


Step 3. Use the properties of the £ 2,5 norm and the B-G Lemma to establish 
boundedness. As in Example 7.4.2., we apply Lemmas 3.3.3, 3.3.2 to (7.7.23) and 
(7.7.24), respectively, to obtain 

||e|| < c||C T e 0 || 

\\Up\\ < c||C ,T e 0 || + c||e|| + c < c||C T e 0 || + c 

IKII < c\\e\\ + c \\y p \\ < c\\C T e Q \\ + c 


(7.7.26) 
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where || • || denotes the C 26 norm for some 6 > 0. 

We relate the term C ,T e 0 with the estimation error by using (7.7.23) to express 
C T e 0 as 

C T e 0 = C T (sI - A o ) _1 (0 1 e 1 - 0 2 u p ) (7.7.27) 

Noting that (C,A a ) is in the observer canonical form, i.e., C T (sI — A 0 ) _1 = 
ari A q Asj a) > we have 


C T e 0 = ^2 —^(Oue 1 - 0 2 iU p ), n = n + q - 1 


i=0 


where 9i = [0,;i, 9i 2 , ■ ■ . , , * = 1,2. Applying Swapping Lemma A.l to each 

term under the summation, we have 


~ A p (s)Qi(s) 

r"iiei = 


^S(s) 


and 


A P (s)Qi(s) 


* » / \ / \ e i T Wd{ s ) (Wbi(s)ei) 6\i 

A p (s)Q 1 (s) 


A*(s)° 2lUp ~ P A*(s ) V® 2 %(s)Q 1 (s)“ p + WciW ( w bi(s)u P )02i 

where W c { . W b i, i = 0, . . . ,n + q — 1 are transfer matrices defined in Lemma A.l 
with W{s ) = ,\ ( s )Qi (, s ) ■ Therefore, C T e 0 can be expressed as 


c T e 0 = 

A *o («) “ V 

0 (g)Qi 
A* (a) 


-ei — 9 


A P (s)Qi( S y L 2i A p (s)Q 1 (s) 


Up) + r i 


_ A p (s)Qi(s) / AT a ra+g-l( s ) AT a n+ 9 -l(s) \ y 00 \ 

— — / \ Ti / \ e l — ^2 “I — / TT 7 s Up + r l 1 < ■ ' ■*°) 

A p (s)Qi(s) Ap(s)Q 1 (s) 


where 


n = Ap ^ s l( ' S) ]T W ci (s)[(W bl (s)e i) 0 l4 - (Wm^Sp) 0 2i ] 

' i=0 

From the definition of 9 \ , we have 

^1 ^n+g— 1(^) — ^1 *Tn-|-g— 1(^) Qn+q— 1(^) 

= i?p(s, i)Q m (s) - s n+q - R p (s)Qm(s ) + s" +<? 

= (Rp(s,t) - Rp(s))Q m (s) = &J a n -i(s)Q m (s) (7.7.29) 

where 9 a == 9 a — 9* is the parameter error. Similarly, 

9 2 Q^n+g— 1 ( 5 ) 9 b Oi n — 1 (s) Q\ (s) 


(7.7.30) 
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A 


where 9 b = 9 b — 9 b . Using (7.7.29) and (7.7.30) in (7.7.28), we obtain 
A p (s)Qi(s) 


C T e„ = 


At(a) 

A p (s)Qi(s) 


2T t \Qm(s) 1 ~ T , . 1 _ 

6 a “n-1 « 777TT7T e l-^ an-l WTJTUp + ?T 
Q i(s) A p (s) A p (s) 


AS(s) 


9 a a n— 1 (s) 


Qm(s) 

< 3 i(s)Ap(s) 


Vp Ob rr n _i(s) 


Qm(^) 

A p (s)Qi(s) 


+U 


(7.7.31) 


where the second equality is obtained using 

Qm(s)c 1 — Qm[s)ypi (^l(s)'Up Qm(.s)Up 

Noting that 

ttn— i(s)— ——Up = </>i, a n _i(s)— j-rVp = ~ 4 > 2 
Ap(s) Ap(s) 

we use Swapping Lemma A.l to obtain the following equalities: 

Qm (^0 


Qm( S ) nT i 2T. / \ 

n ( \ 9 a 02 — -9 a Q!n-i(s) 

Qi{s) 

Qm(s) £jT 


Qi(s)Ap(s) 

Qrn(^) 


2/p + W C9 (W bq (s)c/>J) 9 a 


Chis) 9 * ^ “ ^ a "- l(s) Q 1 ( S I )Ap( s ) Mp + ~ 9b 
where W cq , W bq are as defined in Swapping Lemma A.l with W(s) = ■ Using 


v cq, ™bq 

the above equalities in (7.7.31) we obtain 

C T e 0 = - 

where 


Ap(s)<2m(s) x i 

“4 ~ e ^ + ^ 


(7.7.32) 


a , Ap(s)Qi(s) 

r2 = r ' + ^*5T 


(iu,(») (iu,(»)«>7) S b + w c ,(s) (wi,( s ) 0 j) e. 


From Table 7.5, the normalized estimation error satisfies the equation 

em 2 = -Ojcj) 

which can be used in (7.7.32) to yield 


-yT„ A p (s)Q m (s) 


C e„ = 


From the definition of ?n 2 = 1 + ||w, p || 2 + \\y p \\ 2 and Lemma 3.3.2 , we can show 
that m f is a normalizing signal in the sense that 0/m/,m/r?z/ £ Coo for some 


A£(s) 


em + r 2 


(7.7.33) 
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5 > 0. From the expression of ri,r 2 and the normalizing properties of to/, we use 
Lemma 3.3.2 in (7.7.33) and obtain 

|| C' T e c> || < c||eTOrn/|| + c||0 p to/|| (7.7.34) 


Using (7.7.34) in (7.7.26) and in the definition of to/, we have the following inequal- 
ity: 

To/ < c||erran/|| 2 + ||0 p to/|| 2 + c 
or 

TO/ < c||gTO/|| 2 +C 

where g 2 = e 2 m 2 + \6 p \ 2 and g £ C 2 , to which we can apply the B-G Lemma to 
show that uif £ Coo ■ From to/ £ Coo and the properties of the C 2 s norm we can 
establish boundedness for the rest of the signals. 

Step 4. Convergence of the tracking error to zero. The convergence of the 
tracking error to zero can be proved using the following arguments: Because all 
signals are bounded, we can establish that 4( em 2 ) £ Coo > which, together with 
an 2 £ £ 2 , implies that e(t)m 2 (t) — > 0, and, therefore, 0 p (t ) — * 0 as t — > 00 . From 
the expressions of ri,r 2 we can conclude, using Corollary 3.3.1, that r 2 £ C 2 and 
r 2 (t) — > 0 as t — > 00 . From (7.7.33), i.e., 


C T e 0 


A p {s)Q m M 2 

AS(s) 


+ r 2 


and £TO 2 ,r 2 £ C 2 and r 2 — ^ > 0 as t — > 00 , it follows from Corollary 3.3.1 that 
|C' T e 0 | £ C 2 and |C' T e 0 (t)| — > 0 as t — > 00 . Because, from (7.7.23), we have 

e = ^4 c (t)e + K 0 C T e 0 


it follows from the u.a.s property of A c (t) and the fact that |C' T e 0 | £ C 2 , |C ,T e 0 | — > 0 
as t — > 00 that e — > 0 as t — > 00 . From ei = y p — y m = C T e 0 + C T e (see (7.7.24)), 
we conclude that ei(t) — > 0 as t — > 00 and the proof is complete. 


7.7.3 Proof of Theorem 7.5.1 

We present the stability proof for the hybrid scheme of Table 7.9. The proof of 
stability for the schemes in Tables 7.8 and 7.10 follows using very similar tools and 
arguments and is omitted. 

First, we establish the properties of the hybrid adaptive law using the following 
lemma. 

Lemma 7.7.1 Let 0 2 k. K ok , K ck be as defined in Table 7.9 and 6 p k as defined 
in Table 7.7. The hybrid adaptive law of Table 7.7 guarantees the following: 



524 


CHAPTER 7. ADAPTIVE POLE PLACEMENT CONTROL 


(i) 0 P k G loo and A 9 pk G l 2 ,sm G C 2 

(ii) 9 \ k , 9 2k , Rok £ loo and A9 kkl A9 2k G l 2 
where Axk = Xk+i — x k ; k = 0, 1, 2, . . . 

(iii) If Rp(s,tk)Qm(s), Z p (s,tk) are strongly coprime for each k = 0,1,..., then 
E-ck ^ loo • 


Proof The proof for (i) is given in Section 4.6. 

By definition, 9\ kl 9 2k are the coefficient vectors of R p {s,t k )Q m (s ) — s n+q , 
Zp(s,t k )Qi(s) respectively, where Rjfs, t k ) = s n + 0j fc a„_i(s), Zjfs, t k ) = 9j,a n ^i(s). 
We can write 

9\ k = F\9 a k i 9 2k = F 2 6 bk 

where F k ,F 2 G n( n +i') xn are constant matrices which depend only on the coef- 
ficients of Qi(s), Q m {s), respectively. Therefore, the properties 6i k ,6 2k G foo and 
A0 lfc , A9 2k G t 2 follow directly from (i). Because K ok = a* — 9\ k , we have K ok G loo 
and the proof of (ii) is complete. 

Part (iii) is a direct result of linear system theory. □ 

Lemma 7.5.1 indicates that the hybrid adaptive law given in Table 7.7 has 
essentially the same properties as its continuous counterpart. We use Lemma 7.7.1 
to prove Theorem 7.5.1 for the adaptive law given in Table 7.9. 

As in the continuous time case, the proof is completed in four steps as follows: 

Step 1. Develop the state error equation for the closed-loop APPC scheme. Be- 
cause the controller is unchanged for the hybrid adaptive scheme, we follow exactly 
the same steps as in the proof of Theorem 7.4.2 to obtain the error equations 

e = A ck (t)e + k 0 k(t)C T e 0 
e a = A 0 e 0 + 9\ k (t)ei — 9 2k u p 

y p = C T e 0 + C r e + y m (7.7.35) 

u p = Wi(s)K ck (t)e + W 2 (s)y p 
u p F c k (t) e 

where A ck ,K ok ,K ck ,9\ k ,9 2k are as defined in Section 7.4.3 with 9 b {t),9 a {t) re- 
placed by their discrete versions 9 bk , 9 ak respectively, i.e., K ok = a* — 9i k and K ck 
is solved from det(sl — A k + B k K ck ) = A*(s); A a is a constant stable matrix, 
A ck = A k — B k K ck and Mq (s).W 2 (s) are proper stable transfer functions. Further- 
more, A ck (t), K ck (t), K a k(t), 9i k (t), 9 2k {t) are piecewise constant functions defined 

as f k (t) = f k ,\/t G [kT s , (, k + 1)T S ). 
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Step 2. Establish e.s. for the homogeneous part of (7.7.35). Consider the 
system 

z = A k z, z(t 0 ) = z 0 (7.7.36) 

where 

A k (t) = \ Ack ^ K ok(t)C T ] [kT ^ (fc + 1)2;) 

U A 0 

Because A6i k ,A92k € Ii and (Ak,B k ) is strongly controllable, one can verify 
that AI\ ok , AI\ ck £ Ii and thus AA ck , AA k £ l^. Therefore, for any given small 
number p > 0, we can find an integer N fl such that 2 

\\A k - A N J\ < h, Vfc > 

We write (7.7.36) as 

z = Ajy^z + {A k — Ayjz, Vf > N^T S (7.7.37) 

Because A^ /: is a constant matrix that is stable and \\A k — An || < p, Vfc > N fl , we 
can fix p to be small enough so that the matrix A k = + ( A k — An ) is stable 

which implies that 

\z(t) | < c 1 e- a ( t - Ar - T ^|z(^T s )|, Vt > N^Ts 

for some constants ci, a > 0. Because 

z(N IJ T s ) = e AN » T °e AN »- lT ° ■ ■ ■ e AlT ° e A ° T * z 0 

and Np is a fixed integer, we have 

\z{N tl T s )\<c 2 e-^ N ^- t A\ Zo \ 

for some constant C 2 > 0. Therefore, 

\z(t)\ < c^e-^-^zol Vt > N^T S (7.7.38) 

where Ci, C 2 , a are independent of to, 2o- 

On the other hand, for t £ [t 0 ,N M T s ), we have 

z(t) = e Mt-i T ,) e Ai-iT s . . . e YT Se A 0 T SZo (7.7.39) 

2 If a sequence {/*,} satisfies A f k £ £2, then linin -,00 y)” ||A/,|| = 0, which implies 

that for every given p > 0, there exists an integer IV f that depends on p such that 

II A/ill < P, Vfc > iV M . Therefore, \\f k - f N J = ||eU m A f'\\~ H A /*H < 

p. 
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where i is the largest integer that satisfies i < jr and i < N p . Because N p is a 
fixed integer, it follows from (7.7.39) that 

W ) I < c 3 \z 0 \ < C 3 e -“(t-* 0 )|^|, vt G [to,N„T a ) (7.7.40) 

for some constant c 3 and c 3 = c 3 e aA V T ' ! ) which is independent of to, Zq. Combining 
(7.7.38) and (7.7.40), we have 

\z(t ) | < ce — * o) |^(t 0 ) U Vt > t 0 

for some constants c, a > 0 which are independent of to,zo and therefore (7.7.36), 
the homogeneous part of (7.7.35), is e.s. 

Step 3. Use the properties of the C 2b norm and B-G Lemma to establish signal 
boundedness. Following exactly the same procedure as in the proof of Theorem 7.4.2 
presented in Section 7.7.2 for Step 3 of the continuous APPC, we can derive 

IIto/H 2 < c + c\\C T e 0 \\ 2 

where || • || denotes the £ 2 8 norm, and 

C T e a = C T (sI - A 0 )- 1 (0 lfc ei - 9 2k u p ) (7.7.41) 

Because A9 ak ,A9 bk G l 2 , according to Lemma 5.3.1, there exist vectors 9 a (t),9 b (t) 
whose elements are continuous functions of time such that \9 ak (t) — 9 a (t)\ G C 2 , 

1 9bk{t) — 9 b (t) | G C 2 and 9 a (f),9 b {t) G C 2 . Let 9\,6 2 be the vectors calculated from 
Table 7.9 by replacing 9 ak , 6b k with 9 a , 9 b respectively. As we have shown in the 
proof of Theorem 7.4.2, 9i,9 2 depend linearly on 9 a ,9 b . Therefore, given the fact 
that 9 a , 9 b , 1 9 a - 9 ak I, 1 9 b - 9 bk \ G £ 2 , we have 9i,9 2 , \9\ - 9 lk |, \9 2 - 9 2k \ G £ 2 . 
Thus, we can write (7.7.41) as 

C T e a = C T (sI - A 0 )-\9 iei - 9 2 u p + e) (7.7.42) 

where e = (9i k (t) — 9i(t))ei — ( 6 2k (t ) — 6 2 (t))u p . Now, 9i,9 2 have exactly the same 
properties as 9i,9 2 in equation (7.7.27). Therefore, we can follow exactly the same 
procedure given in subsection 7.7.2 (from equation (7.7.27) to (7.7.33)) with the 
following minor changes: 

• Replace 9 \ , 9 2 by 9 \ , 9 2 

• Replace 9 ai 9 b by 9 a , 9 b 

• Add an additional term C T (si — A 0 ) -1 e in all equations for C T e„ 

• In the final stage of Step 3, to relate 9j(j> 2 — 9 b </>i with em 2 , we write 

6j $2 ~0b<t> 1 = 6a k (f 2 - 9 bk <f>i + e 2 
where e 2 = (9 a - 9 ak ) T 4> 2 - (9 b - 9 bk ) T i e., 

em 2 = 9j k (j> 2 - 9j k (j) 1 = 9j<f> 2 - 9 J fa - e 2 
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At the end of this step, we derive the inequality 

m 2 f < c\\gm f \\ 2 +c 

where g 2 = e 2 m 2 + \9 p \ 2 + \9 p k — 9 P \ 2 , 9 p = [#J, 9j] T , and g € £ 2 - The rest of the 
proof is then identical to that of Theorem 7.4.2. 

Step 4. Establish tracking error converges to zero. This is completed by fol- 
lowing the same procedure as in the continuous case and is omitted here. □ 


7.8 Problems 

7.1 Consider the regulation problem for the first order plant 

x = ax + bu 
y = x 


(a) Assume a and b are known and b ^ 0. Design a controller using pole 

placement such that the closed- loop pole is located at —5. 

(b) Repeat (a) using LQ control. Determine the value of A in the cost 

function so that the closed-loop system has a pole at —5. 

(c) Repeat (a) when a is known but b is unknown. 

(d) Repeat (a) when b is known but a is unknown. 

7.2 For the LQ control problem, the closed-loop poles can be determined from 

G(s ) = Z p (s)/Rp(s) (the open-loop transfer function) and A > 0 (the weight- 
ing of the plant input in the quadratic cost function) as follows: Define 

F(s) = R p (s)Rp(—s) + A ~ 1 Zp(s)Z p (—s). Because F(s) = F(—s), it can 
be factorized as 

F(s) = (s + Pi)(s + P2) ■ ■ ■ (s + Pn)(-S + Pl)(-S +P2) ■ ■ ■ ( -S + Pn ) 

where p t > 0. Then, the closed- loop poles of the LQ control are equal to 
Pl,P 2 ,---,Pn- 

(a) Using this property, give a procedure for designing an ALQC without 

having to solve a Riccati equation. 

(b) What are the advantages and disadvantages of this procedure compared 
with that of the standard ALQC described in Section 7.4.3? 

7.3 Consider the speed control system given in Problem 6.2, where a , b , d are as- 

sumed unknown. 
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(a) Design an APPC law to achieve the following performance specifications: 

(i) The time constant of the closed-loop system is less than 2 sec. 

(ii) The steady state error is zero for any constant disturbance d 

(b) Design an ALQC law such that 

/»00 

J = / (y 2 + \u 2 )dt 

Jo 

is minimized. Simulate the closed-loop ALQC system with a = 0.02, b = 
1.3, d = 0.5 for different values of A. Comment on your results. 

7.4 Consider the following system: 

= 

y s 2 + 2 £u} n s + to 2 

where the parameter w n (the natural frequency) is known, but the damping 
ratio £ is unknown. The performance specifications for the closed-loop system 
are given in terms of the unit step response as follows: (a) the peak overshoot 
is less than 5% and (b) the settling time is less than 2 sec. 

(a) Design an estimation scheme to estimate £ when ui n is known. 

(b) Design an indirect APPC and analyze the stability properties of the 

closed-loop system. 


7.5 Consider the plant 


V = 


s + b 

rU 

s{s + a) 


(a) Design an adaptive law to generate a and b , the estimate of a and 6, 

respectively, on-line. 

(b) Design an APPC scheme to stabilize the plant and regulate y to zero. 

(c) Discuss the stabilizability condition a and b have to satisfy at each time 

t. 

(d) What additional assumptions you need to impose on the parameters a 

and b so that the adaptive algorithm can be modified to guarantee the 
stabilizability condition? Use these assumptions to propose a modified 
APPC scheme. 


7.6 Repeat Problem 7.3 using a hybrid APPC scheme. 

7.7 Solve the MRAC problem given by Problem 6.10 in Chapter 6 using Remark 

7.3.1 and an APPC scheme. 

7.8 Use Remark 7.3.1 to verify that the MRC law of Section 6.3.2 shown in Figure 

6.1 is a special case of the general PPC law given by equations (7.3.3), (7.3.6). 
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7.9 Establish the stability properties of the hybrid ALQ control scheme given in 

Table 7.10. 

7.10 Establish the stability properties of the hybrid APPC scheme of Table 7.8. 

7.11 For n = 2,q = 1, show that A(t) defined in (7.7.5) satisfies det(sl — A(t)) = 
A* A 9 , where A*, A q are defined in Table 7.4. 



Chapter 8 


Robust Adaptive Laws 


8.1 Introduction 

The adaptive laws and control schemes developed and analyzed in Chapters 4 
to 7 are based on a plant model that is free of noise, disturbances and 
unmodeled dynamics. These schemes are to be implemented on actual plants 
that most likely deviate from the plant models on which their design is based. 
An actual plant may be infinite dimensional, nonlinear and its measured 
input and output may be corrupted by noise and external disturbances. 
The effect of the discrepancies between the plant model and the actual plant 
on the stability and performance of the schemes of Chapters 4 to 7 may 
not be known until these schemes are implemented on the actual plant. In 
this chapter, we take an intermediate step, and examine the stability and 
robustness of the schemes of Chapters 4 to 7 when applied to more complex 
plant models that include a class of uncertainties and external disturbances 
that are likely to be present in the actual plant. 

The question of how well an adaptive scheme of the class developed in 
Chapters 4 to 7 performs in the presence of plant model uncertainties and 
bounded disturbances was raised in the late 1970s. It was shown, using 
simple examples, that an adaptive scheme designed for a disturbance free 
plant model may go unstable in the presence of small disturbances [48]. 
These examples demonstrated that the adaptive schemes of Chapters 4 to 
7 are not robust with respect to external disturbances. This nonrobust 
behavior of adaptive schemes became a controversial issue in the early 1980s 
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when more examples of instabilities were published demonstrating lack of 
robustness in the presence of unmodeled dynamics or bounded disturbances 
[85, 197]. This motivated many researchers to study the mechanisms of 
instabilities and find ways to counteract them. By the mid-1980s, several 
new designs and modifications were proposed and analyzed leading to a body 
of work known as robust adaptive control. 

The purpose of this chapter is to analyze various instability mechanisms 
that may arise when the schemes of Chapters 4 to 7 are applied to plant 
models with uncertainties and propose ways to counteract them. 

We start with Section 8.2 where we characterize various plant model un- 
certainties to be used for testing the stability and robustness of the schemes 
of Chapters 4 to 7. In Section 8.3, we analyze several instability mechanisms 
exhibited by the schemes of Chapters 4 to 7, in the presence of external dis- 
turbances or unmodeled dynamics. The understanding of these instability 
phenomena helps the reader understand the various modifications presented 
in the rest of the chapter. In Section 8.4, we use several techniques to modify 
the adaptive schemes of Section 8.3 and establish robustness with respect to 
bounded disturbances and unmodeled dynamics. The examples presented in 
Sections 8.3 and 8.4 demonstrate that the cause of the nonrobust behavior 
of the adaptive schemes is the adaptive law that makes the closed-loop plant 
nonlinear and time varying. The remaining sections are devoted to the devel- 
opment of adaptive laws that are robust with respect to a wide class of plant 
model uncertainties. We refer to them as robust adaptive laws. These robust 
adaptive laws are combined with control laws to generate robust adaptive 
control schemes in Chapter 9. 

8.2 Plant Uncertainties and Robust Control 

The first task of a control engineer in designing a control system is to obtain 
a mathematical model that describes the actual plant to be controlled. The 
actual plant, however, may be too complex and its dynamics may not be 
completely understood. Developing a mathematical model that describes 
accurately the physical behavior of the plant over an operating range is a 
challenging task. Even if a detailed mathematical model of the plant is 
available, such a model may be of high order leading to a complex controller 
whose implementation may be costly and whose operation may not be well 
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understood. This makes the modeling task even more challenging because 
the mathematical model of the plant is required to describe accurately the 
plant as well as be simple enough from the control design point of view. 
While a simple model leads to a simpler control design, such a design must 
possess a sufficient degree of robustness with respect to the unmodeled plant 
characteristics. To study and improve the robustness properties of control 
designs, we need a characterization of the types of plant uncertainties that 
are likely to be encountered in practice. Once the plant uncertainties are 
characterized in some mathematical form, they can be used to analyze the 
stability and performance properties of controllers designed using simplified 
plant models but applied to plants with uncertainties. 

8.2.1 Unstructured Uncertainties 

Let us start with an example of the frequency response of a stable plant. 
Such a response can be obtained in the form of a Bode diagram by exciting 
the plant with a sinusoidal input at various frequencies and measuring its 
steady state output response. A typical frequency response of an actual 
stable plant with an output y may have the form shown in Figure 8.1. 

It is clear that the data obtained for lo > ui m are unreliable because 
at high frequencies the measurements are corrupted by noise, unmodeled 
high frequency dynamics, etc. For frequencies below L0 m , the data are accu- 
rate enough to be used for approximating the plant by a finite-order model. 
An approximate model for the plant, whose frequency response is shown in 
Figure 8.1, is a second-order transfer function Gq(s) with one stable zero 
and two poles, which disregards the phenomena beyond, say lo > io m . The 
modeling error resulting from inaccuracies in the zero-pole locations and 
high frequency phenomena can be characterized by an upper bound in the 
frequency domain. 

Now let us use the above example to motivate the following relationships 
between the actual transfer function of the plant denoted by G(s ) and the 
transfer function of the nominal or modeled part of the plant denoted by 
G 0 (s). 

Definition 8.2.1 (Additive Perturbations) Suppose that.G(s) andGo(s) 
are related by 


G(s) — Gq(s) + A a (s) 


(8.2.1) 



Figure 8.1 An example of a frequency response of a stable plant. 


where A a (s) is stable. Then A a (s) is called an additive plant perturbation 
or uncertainty. The structure of A a (s) is usually unknown but A a (s) is 

assumed to satisfy an upper bound in the frequency domain, i.e. , 

\A a {ju)\ < 6 a {u) Vu (8.2.2) 

for some known function S a (cv). In view of (8.2.1) and (8.2.2) defines a family 
of plants described by 

n a = {G| \G(ju>) - G 0 (M\ <<5 a (w)} (8.2.3) 

The upper bound 5 a (u) of A a {ju) may be obtained from frequency re- 
sponse experiments. In robust control [231], Gq(s) is known exactly and the 
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uncertainties of the zeros and poles of G(s) are included in A a (s). In adap- 
tive control, the parameters of Go(s) are unknown and therefore zero-pole 
inaccuracies do not have to be included in A a (s). Because the main topic 
of the book is adaptive control, we adopt Definition 8.2.1, which requires 
A a (s) to be stable. 

Definition 8.2.2 (Multiplicative Perturbations) Let G(s), Gq(s) be re- 
lated by 

G(s) = Go(s)(l + A m (s)) (8.2.4) 

where A m (s) is stable. Then A m (s) is called a multiplicative plant pertur- 
bation or uncertainty. 

In the case of multiplicative plant perturbations, A m (s) may be constrained 
to satisfy an upper bound in the frequency domain, i.e., 


A m (ju)\ < S m (oj) 


(8.2.5) 


for some known S m (u>) which may be generated from frequency response 
experiments. Equations (8.2.4) and (8.2.5) describe a family of plants given 
by 


n m=<G 


I Gjjio) - G 0 {jco)\ 

\Go(Ju)\ 


< dm{u) 


( 8 . 2 . 6 ) 


For the same reason as in the additive perturbation case, we adopt Defini- 
tion 8.2.2 which requires A m (s) to be stable instead of the usual definition 
in robust control where A m (s) is allowed to be unstable for a certain family 
of plants. 


Definition 8.2.3 (Stable Factor Perturbations) Let G(s), G'o(s) have 
the following coprime factorizations [231]: 


G{s) 


Np(s) + Ai(s) 
Do {s') + A 2 (s) 


G 0 {s) 


N 0 {s) 

D 0 (s) 


(8.2.7) 


where No and Dq are proper stable rational transfer functions that are co- 
prime, 1 and Ai(s) and A 2 (s) are stable. Then Ai(s) and A 2 (s) are called 
stable factor plant perturbations. 


x Two proper transfer functions P(s), Q(s) are coprime if and only if they have no finite 
common zeros in the closed right half s-plane and at least one of them has relative degree 
zero [231]. 
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A m (s) 


-b 


+ 


G 0 (s) 


(ii) 



(iii) 


Figure 8.2 Block diagram representations of plant models with (i) addi- 
tive, (ii) multiplicative, and (iii) stable factor perturbations. 


Figure 8.2 shows a block diagram representation of the three types of plant 
model uncertainties. 

The perturbations A a (s), A m (s), Ai(s), and A 2 (s) defined above with 
no additional restrictions are usually referred to as unstructured plant model 
uncertainties. 
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8.2.2 Structured Uncertainties: Singular Perturbations 

In many applications, the plant perturbations may have a special form be- 
cause they may originate from variations of physical parameters or arise 
because of a deliberate reduction in the complexity of a higher order math- 
ematical model of the plant. Such perturbations are usually referred to as 
structured plant model perturbations. 

The knowledge of the structure of plant model uncertainties can be ex- 
ploited in many control problems to achieve better performance and obtain 
less conservative results. 

An important class of structured plant model perturbations that describe 
a wide class of plant dynamic uncertainties, such as fast sensor and actuator 
dynamics, is given by singular perturbation models [106]. 

For a SISO, LTI plant, the following singular perturbation model in the 
state space form 

x = A\\x + A\ 2 Z + B\u, x € H n 
pz = A- 2 \x + A- 22 Z + B 2 U, z € 1 Z"‘ (8.2.8) 

y — f ' 1 x A C '2 z 

can be used to describe the slow (or dominant) and fast (or parasitic) phe- 
nomena of the plant. The scalar p represents all the small parameters such 
as small time constants, small masses, etc., to be neglected. In most appli- 
cations, the representation (8.2.8) with a single parameter p can be achieved 
by proper scaling as shown in [106]. All the matrices in (8.2.8) are assumed 
to be constant and independent of p. As explained in [106], this assumption 
is for convenience only and leads to a minor loss of generality. 

The two time scale property of (8.2.8) is evident if we use the change of 
variables 

Zf = z + L(p)x (8.2.9) 

where L(p) is required to satisfy the algebraic equation 

A 21 — A 22 L + pLAn — pLA\2L = 0 (8.2.10) 

to transform (8.2.8) into 

x = A s x + A\ 2 Zf + B\u 

pzj = AfZf + B s u (8.2.11) 

y = Cj x + Cj Zf 
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where A s = An — Ai 2 L, Af = A 22 + /aLAi 2 , B s = B 2 + nLBi , Cj =Cj- C 2 L. 
As shown in [106], if A 22 is nonsingular, then for all /i G [0,/t*) and some 
[A* > 0, a solution of the form 


L — ^22*^21 + 0(/i) 


satisfying (8.2.10) exists. It is clear that for u = 0, i.e., 


x = A s x + A\ 2 Zf 
fiZf = A fZ f 


(8.2.12) 


the eigenvalues of (8.2.12) are equal to those of A s and Af/n, which, for 
small n and for Af nonsingular, are of 0(1) and 0(l/fi), respectively 2 . The 
smaller the value of /j, the wider the distance between the eigenvalues of A s 
and Af/n, and the greater the separation of time scales. It is clear that if 
Af is stable then the smaller the value of fi is, the faster the state variable 
Zf goes to zero. In the limit as n — > 0, Zf converges instantaneously, i.e., 
infinitely fast to zero. Thus for small /j, the effect of stable fast dynamics is 
reduced considerably after a very short time. Therefore, when A 22 is stable 
(which for small // implies that Af is stable), a reasonable approximation of 
(8.2.8) is obtained by setting /a = 0, solving for z from the second equation 
of (8.2.8) and substituting for its value in the first equation of (8.2.8), i.e., 


X 0 = AqXq + BqU, Xq G PC' 

y 0 = Cjx 0 + D 0 u 

where A$ = An — Ai 2 A 2 2 1 A 2 i, Bq = B\ — A\ 2 A 2 2 B 2l Cq = Cj — C 2 A 22 A 2 i 
and Do = —Cj A 22 B 2 . With n set to zero, the dimension of the state space 
of (8.2.8) reduces from n + mton because the differential equation for z in 
(8.2.8) degenerates into the algebraic equation 


0 = A 2 iXq + A 2 2^0 + B 2 u 


i.e., 

zq = —A 22 (A 2 iXo + B 2 u) (8.2.14) 


2 We say that a function f(x) is 0(|x|) in V C lZ n if there exists a finite constant c > 0 
such that |/(x)| < c|x| for all x G V where x = 0 £ V. 
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where the subscript 0 is used to indicate that the variables belong to the 
system with p, = 0. The transfer function 

G 0 (s) = Cj(sI-A 0 )- 1 B 0 + D 0 (8.2.15) 

represents the nominal or slow or dominant part of the plant. 

We should emphasize that even though the transfer function G(s) from u 
to y of the full-order plant given by (8.2.8) is strictly proper, the nominal part 
Go(s) may be biproper because D q = —Cj A^ 2 B 2 may not be equal to zero. 
The situation where the throughput D$ = —Cj A^J £>2 induced by the fast 
dynamics is nonzero is referred to as strongly observable parasitics [106]. As 
discussed in [85, 101], if G'o(-s) is assumed to be equal to Cj (si — Ao)” 1 ^ 
instead of (8.2.15), the control design based on Go(s) and applied to the 
full-order plant with p > 0 may lead to instability. One way to eliminate 
the effect of strongly controllable and strongly observable parasitics is to 
augment (8.2.8) with a low pass filter as follows: We pass y through the 
filter for some fi, fo > 0, i.e., 

if = -fovi + fry (8.2.16) 

and augment (8.2.8) with (8.2.16) to obtain the system of order (n + 1 + m) 

x a = A n x a + A 12 z + Bpu 

/iz = A 2 ix a + A 22 Z + B 2 u (8.2.17) 

y = Cjx a 


where x a = [yf,x T \ T and An, A\ 2 , A 2 \, B\,Ci are appropriately defined. 
The nominal transfer function of (8.2.17) is now 

G 0 (s) = Gj( s I-A 0 r 1 Bo 


which is strictly proper. 

Another convenient representation of (8.2.8) is obtained by using the 
change of variables 

j] = Zf + Aj 1 B s u (8.2.18) 

i.e., the new state y represents the difference between the state Zf and the 
“quasi steady” state response of (8.2.11) due to h / 0 obtained by approxi- 
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mating yzf « 0. Using (8.2.18), we obtain 

x = A s x + Ai 2 r] + B s u 

fir ) = Afq + nA~j l B s u (8.2.19) 

y = Cjx + Cl rj + D s u 

where B s = B\ — A\ 2 A/ 1 B s , D s = — Cj Aj 1 B s , provided u is differentiable. 
Because for |u| = 0(1), the slow component of rj is of 0(h), i.e., at steady 
state \rj\ = 0(h) , the state rj is referred to as the parasitic state. It is clear 
that for |u| = 0(1) the effect of rj on x at steady state is negligible for small 
H whereas for |u| > 0(1/ h), \ij\ is of 0(1) at steady state, and its effect on 
the slow state x may be significant. The effect of u and r/ on x is examined 
in later sections in the context of robustness of adaptive systems. 

8.2.3 Examples of Uncertainty Representations 

We illustrate various types of plant uncertainties by the following examples: 

Example 8.2.1 Consider the following equations describing the dynamics of a DC 
motor 

Jlo = k\i 

T di 

L— = —K 2 U> — Rt + v 
at 

where i, v, R and L are the armature current, voltage, resistance, and inductance, 
respectively; J is the moment of inertia; u is the angular speed; k\i and k 2 u) are 
the torque and back e.m.f., respectively. 

Defining x — u>, z = i we have 

x = b()Z, H% — —a 2 x — ol\z + v 
V = x 


where 6o = k\/J,a 2 = k 2 ,a± = R and h = L, which is in the form of the singular 
perturbation model (8.2.8). The transfer function between the input v and the 
output y is given by 


y(s) 

v(s) 


bo 

Hs 2 + aqs + «o 


G(s,hs) 


where ao = boa 2 . In most DC motors, the inductance L = y is small and can be 
neglected leading to the reduced order or nominal plant transfer function 


(X\ s T cx o 
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Using G 0 (s) as the nominal transfer function, we can express G(s,/j,s) as 

b 0 s 2 


G(s,fis) = G 0 (s) + A a (s,/zs), A a (s,ns) = -/z 


(fxs 2 + a\s + a 0 )(o;is + ao) 


where A a (s,n s ) is strictly proper and stable since /z, a i, ao > 0 or as 

s 2 

G(s, ns) = G 0 (s)( 1 + A m (s, ns)), A m (s, ns) = ~n — 5 — — 

ns z + ai s + «o 

where A m (s,/zs) is proper and stable. 

Let us now use Definition 8.2.3 and express G(s, ns) in the form of stable factor 
perturbations. We write G(s,/zs) as 


G(s,ns ) 


GTaj + A x (s,ns) 

fi^ + Ms^s) 


where 

Ar (s,ns) = - nj — & °w tv , 

\f-lS + + A) 

and A > 0 is an arbitrary constant. 


A 2 (s,ns) = ~n 


OIqS 

(/iS + Qi)(s + A) 


v 


Example 8.2.2 Consider a system with the transfer function 

G(*) = e-™ ^ 

s z 

where r > 0 is a small constant. As a first approximation, we can set r = 0 and 
obtain the reduced order or nominal plant transfer function 

Go (s) = ~2 
s z 


leading to 

with A m (s) = e~ TS 


G(s) — Gq(s)(1 + A m (s)) 


1. Using Definition 8.2.3, we can express G(s) as 


_ Nq(s) + Ai(s) 
Dq(s) + A 2 (s) 


where N 0 (s) = (s+ \ )2 , D 0 (s) = (s * A)2 , Ai(s) = e (s " A)2 , and A 2 (s) = 0 where A > 0 
is an arbitrary constant. 

It is clear that for small r, A m (s), Ai(s) are approximately equal to zero at low 
frequencies. V 
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Figure 8.3 General feedback system. 

8.2.4 Robust Control 

The ultimate goal of any control design is to meet the performance require- 
ments when implemented on the actual plant. In order to meet such a goal, 
the controller has to be designed to be insensitive, i.e., robust with respect to 
the class of plant uncertainties that are likely to be encountered in real life. 
In other words, the robust controller should guarantee closed-loop stability 
and acceptable performance not only for the nominal plant model but also 
for a family of plants, which, most likely, include the actual plant. 

Let us consider the feedback system of Figure 8.3 where C, F are designed 
to stabilize the nominal part of the plant model whose transfer function is 
Gq(s). The transfer function of the actual plant is G(s ) and d u , d, d n , y rn are 
external bounded inputs as explained in Section 3.6. The difference between 
G(s) and Gq(s) is the plant uncertainty that can be any one of the forms 
described in the previous section. Thus G(s) may represent a family of 
plants with the same nominal transfer function Go(s) and plant uncertainty 
characterized by some upper bound in the frequency domain. We say that 
the controller (C, F ) is robust with respect to the plant uncertainties in G(s) 
if, in addition to Go(s), it also stabilizes G(s). The property of C, F to 
stabilize G(s) is referred to as robust stability. 

The following theorem defines the class of plant uncertainties for which 
the controller C , F guarantees robust stability. 

Theorem 8.2.1 Let us consider the feedback system of Figure 8.3 where 


(i) G{s) — Gq(s) + A a (s) 

(ii) G(s) = G'o(s)(l + A m (s)) 
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(in) G(s) 


Np(s) + Ai(s) _ Np(s) 

D 0 (s) + A 2 (s )’ 01 j D 0 (s ) 


where A a (s), A m (s), Ai(s), A 2 (s), Nq(s), and Do(s) are as defined in Section 
8.2.1 and assume that C,F are designed to internally stabilize the feedback 
system when G(s) = Gq(s). Then the feedback system with G(s) given by 
(i), (ii), (Hi) is internally stable provided conditions 


(i) 


C(s)F(s) 

l + C(s)F(s)G 0 (s) 


5 a (u)) < 1 


(ii) 


C(s)F(s)Gp(s) 
l + C(s)F(s)G 0 (s) 


dm(^) V 1 

oo 


A 2 (s) + C(s)F(s)A 1 (s) 
D 0 (s) + C(s)F(s)N 0 (s) 


are satisfied for all u, respectively. 


( 8 . 2 . 20 ) 

(8.2.21) 

(8.2.22) 


Proof As in Sections 3.6.1 and 3.6.2, the feedback system is internally stable if 
and only if each element of the transfer matrix 



1 

~FG 

-F 

—F 

1 

C 

1 

-FC 

-FC 

1 + FCG 

CG 

G 

1 

-FCG 


CG 

G 

1 

1 


has stable poles. Because C, F are designed such that each element of H(s ) with 
G = G 0 has stable poles and A a , A m , A x , and A 2 are assumed to have stable poles, 
the only instability which may arise in the feedback system is from any unstable 
root of 

1 + F(s)C(s)G(s) = 0 (8.2.23) 

(i) Let us consider G = Go + A a , then (8.2.23) can be written as 


1 + 


FCA a 
1 + FCG 0 


= 0 


(8.2.24) 


Because 1 + FCG a 1 ^ iave stable poles, it follows from the Nyquist criterion that 
the roots of (8.2.24) are in the open left-half s-plane if the Nyquist plot of 1 +fcq 0 
does not encircle the (— 1, jO) point in the complex s = cr+juj plane. This condition 
is satisfied if 

C{ju)F(jw)5 a (uj) 

1 + C(jw)F(juj)Go(jw) 


< 1 , Vc o G 7Z 
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The above condition implies that the Nyquist plot of l + ( j?cQ a does not encircle the 
(— 1, jO) point for any A a (s) that satisfies |A a (jw)| < 5 a (u>). Hence, the feedback 
system is internally stable for any G(s) in the family 

n a = {G| \G(ju) — G 0 (ja;)| < S a (u) } 


(ii) If G = G 0 (l + A m ) equation (8.2.23) may be written as 


1 + 


FCG 0 
1 + FCG 0 


A, 


= 0 


and the proof follows as in (i). 


(iii) If G = , equation (8.2.23) may be written as 


1 + 


1 

(Do + FCN 0 ) 


[A 2 + FC A x ] = 0 


Because Do+ p CNg , d 0 +fcn 0 > A 2 , Ai have stable poles, the result follows by apply- 
ing the Nyquist criterion as in (i). □ 


We should emphasize that conditions (8.2.20) to (8.2.22) are not only 
sufficient for stability but also necessary in the sense that if they are violated, 
then within the family of plants considered, there exists a plant G for which 
the feedback system with compensators F, C is unstable. 

Conditions (8.2.20) to (8.2.22) are referred to as conditions for robust 
stability. They may be used to choose F. C such that in addition to achieving 
internal stability for the nominal plant, they also guarantee robust stability 
with respect to a class of plant uncertainties. 

As in Section 3.6.3, let us consider the performance of the feedback sys- 
tem of Figure 8.3 with respect to the external inputs in the presence of 
dynamic plant uncertainties. We concentrate on the case where F(s) = 1 
and d n = d u = 0 and the plant transfer function is given by 

G(s) = Go(s)(l + A m (s)) (8.2.25) 

By performance, we mean that the plant output y is as close to the reference 
input y rn as possible for all plants G in the family II m despite the presence 
of the external inputs y m ,d. From Figure 8.3, we have 

CG 1 

V ~ 1 + CG Vm + 1 + CG d 
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or 

e = y m -y= 1 + CG (ym - d) (8.2.26) 

Because 1 + CG = 1 + CGq + CGoA m . (8.2.26) can be expressed as 


e = 


So 


1 + Tq A, 


(ym - d ) 


(8.2.27) 


where So = 1+ ^ G() , To = are the sensitivity and complementary sen- 

sitivity functions for the nominal plant, respectively. 

For robust stability we require 


To(s)||oo(5 m (w) 


C(s)G 0 (s) 

1 + C(s)Go(s) 


5 m {u) < 1 

OO 


which suggests that the loop gain Lq = CGq should be much less than 1 
whenever <5 m (tu) exceeds 1. For good tracking performance, i.e., for small 
error e, however, (8.2.27) implies that So = 1 + cg 0 s ^ ou ^ be small, which, in 
turn, implies that the loop gain Lq should be large. This is the classical trade- 
off between nominal performance and robust stability that is well known 
in the area of robust control. A good compromise may be found when 
5 m {ui) < 1 at low frequencies and v = y rn — d is small at high frequencies. 
In this case the loop gain Lq = C Go can be shaped, through the choice of 
C, to be large at low frequencies and small at high frequencies. 

The trade-off design of the compensator C to achieve robust stability and 
performance can be formulated as an optimal control problem where the C 2 
norm of e, i.e., 



is minimized with respect to C for the “worst” plant within the family II m 
and the “worst” input v within the family of the inputs considered. This 
“min-max” approach is used in the so-called H -2 and optimal control 
designs that are quite popular in the literature of robust control [45, 152], 


8.3 Instability Phenomena in Adaptive Systems 

In Chapters 4 to 7, we developed a wide class of adaptive schemes that meet 
certain control objectives successfully for a wide class of LTI plants whose 



8.3. INSTABILITY PHENOMENA IN ADAPTIVE SYSTEMS 


545 


parameters are unknown. Some of the basic assumptions made in designing 
and analyzing these schemes are the following: 

• The plant is free of noise disturbances, unmodeled dynamics and un- 
known nonlinearities. 

• The unknown parameters are constant for all time. 

Because in applications most of these assumptions will be violated, it is of 
interest from the practical point of view to examine the stability proper- 
ties of these schemes when applied to the actual plant with disturbances, 
unmodeled dynamics, etc. 

In this section we show that some of the simple adaptive schemes ana- 
lyzed in the previous chapters can be driven unstable by bounded distur- 
bances and unmodeled dynamics. We study several instability mechanisms 
whose understanding helps find ways to counteract them by redesigning the 
adaptive schemes of the previous chapters. 

8.3.1 Parameter Drift 

Let us consider the same plant as in the example of Section 4.2.1, where the 
plant output is now corrupted by some unknown bounded disturbance d(t), 
i.e., 

y = 9*u + d 


The adaptive law for estimating 9* derived in Section 4.2.1 for d(t) = 0 
Vf > 0 is given by 

9 = 7 d u, ei = y — 9u (8.3.1) 

where 7 > 0 and 9{t) is the on-line estimate of 9*. We have shown that for 
d(t) = 0 and u, u E (8.3.1) guarantees that 9,e 1 E Too and e\ (t) — > 0 as 
t — ■> 00 . Let us now analyze (8.3.1) when d(t) 7 ^ 0. Defining 9 = 9 — 9*, we 
have 

ei = — 9u + d 


9 = —7 u 2 9 + 'ydu 


and 


(8.3.2) 
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We analyze (8.3.2) by considering the function 

V(9) = f (8.3.3) 

2 7 

Along the trajectory of (8.3.2), we have 

V = — 0 2 u 2 + ddu = —~y \^ u ~ ~2 (8.3.4) 

For the class of inputs considered, i.e. , u € Coo we cannot conclude that 
9 is bounded from considerations of (8.3.3), (8.3.4), i.e., we cannot find a 
constant Vo > 0 such that for V > Vo, V < 0. In fact, for 0* = 2,7 = 1, 
u = (1 + t)~ 2 6 Coo and 

d(t) = (1 + t)~* — 2(1 + t)~^j — > 0 as t — > 00 

we have 

5 1 

y(t ) = -(1 + — > 0 as t —* 00 

€\(t) = -(1 + t )~ 2 — > 0 as t 00 

and 

0(f) = (1 + t)2 — > 00 as t — > 00 

i.e., the estimated parameter 0(f) drifts to infinity with time. We refer to 
this instability phenomenon as parameter drift. 

It can be easily verified that for u(t) = (1 + 1)~2 the equilibrium 9 e = 0 
of the homogeneous part of (8.3.2) is u.s. and a.s., but not u.a.s. Therefore, 
it is not surprising that we are able to find a bounded input 'ydu that leads 
to an unbounded solution 9(t) = 9(t) — 2. If, however, we restrict u(t) to be 
PE with level ao > 0, say, by choosing u 2 = ao, then it can be shown (see 
Problem 8.4) that the equilibrium 9 e = 0 of the homogeneous part of (8.3.2) 
is u.a.s. (also e.s.) and that 

lim sup|0(r)| < — , do = sup \d(t)u(t)\ 

00 T >t 01 0 t 

i.e., 9(t) converges exponentially to the residual set 

D e = {~9 | 0 |<-} 

l ao J 



8.3. INSTABILITY PHENOMENA IN ADAPTIVE SYSTEMS 


547 


indicating that the parameter error at steady state is of the order of the 
disturbance level. 

Unfortunately, we cannot always choose u to be PE especially in the case 
of adaptive control where u is no longer an external signal but is generated 
from feedback. 

Another case of parameter drift can be demonstrated by applying the 
adaptive control scheme 

u = —kx, k = 7 a ; 2 (8.3.5) 

developed in Section 6.2.1 for the ideal plant x = ax + u to the plant 

x = ax + u + d (8.3.6) 

where d(t) is an unknown bounded input disturbance. It can be verified that 
for k(0) = 5, x'(0) = 1, a = 1,7 = 1 and 

d(t ) = (1 + t)~ 5 (5 — (1 + t )~ s — 0.4(1 + t)~ 5 ^ — > 0 as t — > 00 
the solution of (8.3.5), (8.3.6) is given by 

x(t) = (1 + t )~ s — >■ 0 as t — > 00 


and 

k(t ) = 5(1 + t) s — >• 00 as t — ► 00 

We should note that since k( 0) = 5 > 1 and k(t) > k(0) Vt > 0 parameter 
drift can be stopped at any time t by switching off adaptation, i.e. , setting 
7 = 0 in (8.3.5), and still have x, u G and x(t) — > 0 as t — ► 00 . For 
example, if 7 = 0 for t > t\ > 0, then k(t) = k Vt > t\, where k > 5 is a 
stabilizing gain, which guarantees that u, x € and x(t) — > 0 as t — > 00 . 

One explanation of the parameter drift phenomenon may be obtained by 
solving for the “quasi” steady state response of (8.3.6) with u = — kx , i.e., 

d 


Clearly, for a given a and d, the only way for x s to go to zero is for k —> 00 . 
That is, in an effort to eliminate the effect of the input disturbance d and send 
x to zero, the adaptive control scheme creates a high gain feedback. This 
high gain feedback may lead to unbounded plant states when in addition to 
bounded disturbances, there are dynamic plant uncertainties, as explained 
next. 
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8.3.2 High-Gain Instability 


Consider the following plant: 

y = i - ts = i 

u (s — a)(l + ys) s — a 

where y is a small positive number which may be due to a small time constant 
in the plant. 

A reasonable approximation of the second-order plant may be obtained 
by approximating y with zero leading to the first-order plant model 

^ (8.3.8) 

u s — a 


1 - 


2 ns 


1 + ys\ 


(8.3.7) 


where y denotes the output y when y = 0. The plant equation (8.3.7) is in 
the form of y = G 0 (l + A m )u where G 0 (s) = g ] _ a and A m (s) = - ' 2 ^ s . It 
may be also expressed in the singular perturbation state-space form discussed 
in Section 8.2, i.e. , 

x = ax + z — u 

Hz = — z + 2 u (8.3.9) 

y = x 


Setting h 
i.e., 


0 in (8.3.9) we obtain the state-space representation of (8.3.8), 


x = ax + u 
y = x 


(8.3.10) 


where x denotes the state x when y = 0. Let us now design an adaptive 
controller for the simplified plant (8.3.10) and use it to control the actual 
second order plant where y > 0. As we have shown in Section 6.2.1, the 
adaptive law 

u = —kx, k = yeix, e\ = x 


can stabilize (8.3.10) and regulate y = x to zero. Replacing y with the actual 
output of the plant y = x. we have 


u = — kx , 


k = ' ye\x , 


ei = x 


(8.3.11) 


which, when applied to (8.3.9), gives us the closed- loop plant 


x = (a + k)x + z 
yz = —z — 2 kx 


(8.3.12) 
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whose equilibrium x e = 0, z e = 0 with k = k* is a.s. if and only if the 
eigenvalues of (8.3.12) with k = k* are in the left-half s-plane, which implies 
that 

1 

a > k > a 

Because k > 0, it is clear that 

k( 0) > a =7- k(t) > a, Vf > 0 

/i fj. 

that is, from such a k(0 ) the equilibrium of (8.3.12) cannot be reached. 
Moreover, with k > - — a the linear feedback loop is unstable even when the 
adaptive loop is disconnected, i.e. , 7 = 0. 

We refer to this form of instability as high-gain instability. The adaptive 
control law (8.3.11) can generate a high gain feedback which excites the 
unmodeled dynamics and leads to instability and unbounded solutions. This 
type of instability is well known in robust control with no adaptation [106] 
and can be avoided by keeping the controller gains small (leading to a small 
loop gain) so that the closed-loop bandwidth is away from the frequency 
range where the unmodeled dynamics are dominant. 

8.3.3 Instability Resulting from Fast Adaptation 

Let us consider the second-order plant 

x = —x + bz — u 

y,z=—z + 2u (8.3.13) 

y = x 

where b > 1/2 is an unknown constant and fi > 0 is a small number. For 
u = 0, the equilibrium x e = 0, z e = 0 is e.s. for all /i > 0. The objective 
here, however, is not regulation but tracking. That is, the output y = x is 
required to track the output x m of the reference model 

Xm = -Xm + r (8.3.14) 

where r is a bounded piecewise continuous reference input signal. For /i = 0, 
the reduced-order plant is 

x = — x + (26 — l)w 
y = x 


(8.3.15) 
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which has a pole at s = —1, identical to that of the reference model, and an 
unknown gain 2b — 1 > 0. The adaptive control law 

u = Ir, l = —761 r, ei = x — x rn (8.3.16) 

guarantees that x(t ), l(t) are bounded and \x(t) — x m (t)\ — > 0 as t — > 00 for 
any bounded reference input r. If we now apply (8.3.16) with x replaced by 
x to the actual plant (8.3.13) with fj, > 0, the closed-loop plant is 


x = —x + bz — Ir 

[iz = —z + 2lr (8.3.17) 

1 = - 7 r(x - x m ) 

When 7 = 0 that is when l is fixed and finite, the signals x(t) and z(t) are 
bounded. Thus, no instability can occur in (8.3.17) for 7 = 0. 

Let us assume that r = constant. Then, (8.3.17) is an LTI system of the 
form 

Y = AY + B'yrXm (8.3.18) 

where Y = [x, z, l] T , B = [0, 0, 1] T , 

— 1 b —r 

A= 0 —1/m 2?’/m 

—7 r 0 0 

and 7 rx m is treated as a bounded input. The characteristic equation of 
(8.3.18) is then given by 

det(sl — A) = -s 3 + (1 + — )s 2 + (— — 7 r 2 )s + —r 2 (2b — 1) (8.3.19) 

MM M 

Using the Routh-Hurwitz criterion, we have that for 


(1 + m) 
(26 + m) 


(8.3.20) 


two of the roots of (8.3.19) are in the open right-half s-plane. Hence given any 
M > 0, if 7, r satisfy (8.3.20), the solutions of (8.3.18) are unbounded in the 
sense that |T(t)| — > 00 as t — > 00 for almost all initial conditions. Large yr 2 
increases the speed of adaptation, i.e. , l, which in turn excites the unmodeled 
dynamics and leads to instability. The effect of l on the unmodeled dynamics 
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can be seen more clearly by defining a new state variable rj = z — 2 Ir, called 
the parasitic state [85, 106], to rewrite (8.3.17) as 

x = —x + (26 — T)lr + brj 

gf] = — r) + 2 / u 7 ?’ 2 (.t — x m ) — 2 gif (8.3.21) 

1 = - 7 r(cc - x m ) 

where for r = constant, r = 0. Clearly, for a given g, large 7 r 2 may lead to 
a fast adaptation and large parasitic state r] which acts as a disturbance in 
the dominant part of the plant leading to false adaptation and unbounded 
solutions. For stability and bounded solutions, 7 ?’ 2 should be kept small, i.e. , 
the speed of adaptation should be slow relative to the speed of the parasitics 
characterized by -. 


8.3.4 High-Frequency Instability 

Let us now assume that in the adaptive control scheme described by (8.3.21), 
both 7 and |r| are kept small, i.e., adaptation is slow. We consider the case 
where r = rosin cut and ro is small but the frequency cu can be large. At 
lower to middle frequencies and for small 7 r 2 , g, we can approximate gq ~ 0 
and solve (8.3.21) for q, i.e., q ~ 2 /i 7 r 2 (x — x m ) — 2]ilr, which we substitute 
into the first equation in (8.3.21) to obtain 

x = -(1 - 2/t6 7 ? ,2 )x + (gr - 2bgr)l - 2gh^r 1 x m , Q 0 

1 = -M(x - x m ) (8 - 3 ‘ 22) 


where g = 26—1. Because 7 r is small we can approximate l(t) with a 
constant and solve for the sinusoidal steady-state response of x from the 
first equation of (8.3.22) where the small /U 7 r 2 terms are neglected, i.e., 


Xss 


r o 

1 +L0 2 


[(g — 2b/j,uj 2 ) sin cut 


u){g + 2bg)coscot]l 


Now substituting for x = x ss in the second equation of (8.3.22) we obtain 



Mo si n ut 
1 + cu 2 


(, g — 2b/Mjj 2 ) sin cut — c o[g + 26/i)coscut l s + jx m ro sin cut 


i.e. 


l s = a(t)l s + jx m r 0 sin cut 


(8.3.23) 
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A 'yr'f.smujt r « 

a(i) = — ( g — 2bg,uj ) sinwf — to{g + 2o/i)coscut 


v ' 1 + co 2 W ^ w J 

where the subscript ‘s’ indicates that (8.3.23) is approximately valid for 
slow adaptation, that is, for 7 rg(l + w 2 ) -1 / 2 sufficiently small. The slow 
adaptation is unstable if the integral of the periodic coefficient a(t) over the 
period is positive, that is, when /. iuj 2 > g/(2b), or to 2 > g/(26/j) [3]. 

The above approximate instability analysis indicates that if the reference 
input signal r(t) has frequencies in the parasitic range, (i.e., of the order of f 
and higher), these frequencies may excite the unmodeled dynamics, cause the 
signal-to-noise ratio to be small and, therefore, lead to the wrong adjustment 
of the parameters and eventually to instability. We should note again that 
by setting 7 = 0, i.e., switching adaptation off, instability ceases. 

A more detailed analysis of slow adaptation and high-frequency instabil- 
ity using averaging is given in [3, 201]. 


8.3.5 Effect of Parameter Variations 

Adaptive control was originally motivated as a technique for compensating 
large variations in the plant parameters with respect to time. It is, therefore, 
important to examine whether the adaptive schemes developed in Chapters 
4 to 7 can meet this challenge. 

We consider the linear time varying scalar plant 


x = a(t)x + b{t)u 


(8.3.24) 


where a(t) and b(t ) are time varying parameters that are bounded and have 
bounded first order derivatives for all t and b(t ) > bo > 0. We further assume 
that |a(t)|, \b(t)\ are small, i.e., the plant parameters vary slowly with time. 
The objective is to find u such that x tracks the output of the LTI reference 
model 

x m = -x m + r 

for all reference input signals r that are bounded and piecewise continuous. 
If a(f), b(t) were known for all t > 0, then the following control law 


u = —k*(t)x + l*(t)r 


where 


k*(t) = 


1 + a(t) 


m = 
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would achieve the control objective exactly. Since a(t) and b(t) are unknown, 
we try 

u = —k(t)x + l(t)r 

together with an adaptive law for adjusting k(t) and l(t). Let us use the same 
adaptive law as the one that we would use if a(t) and b(t) were constants, 
i.e., following the approach of Section 6.2.2, we have 

k = 7 ieiz, 1 = — 7 2 e 1 r, 71,72 >0 (8.3.25) 

where e\ = e = x — x m satisfies 

€1 = — ei + b(t)(— kx + Ir) (8.3.26) 

where k(t) = k(t ) — k*(t),l(t ) = l(t) — Because k*(t),l*(t ) are time 

varying, k = k — k*, 1 = 1 — l* and therefore (8.3.25), rewritten in terms of 
the parameter error, becomes 

k = 7 ieix — k * , l = — 72 £ir — /* (8.3.27) 


Hence, the effect of the time variations is the appearance of the disturbance 
terms k*,l*, which are bounded because d(t), b(t) are assumed to be bounded. 
Let us now choose the same Lyapunov-like function as in the LTI case for 
analyzing the stability properties of (8.3.26), (8.3.27), i.e., 

V(ei ,k,l) = y + + (8.3.28) 

where b(t) > > 0. Then along the trajectory of (8.3.26), (8.3.27) we have 


V 


—ef — e\ bkx + e\ blr + b h bke \x — 

271 

; l 2 uT Ji* 

+0 ble\r — b — 

272 72 

2 \7i I2) \ 71 72 J 


bkk* 

7i 


(8.3.29) 


From (8.3.28) and (8.3.29) we can say nothing about the boundedness of 
ei,k,l unless k*,l* are either zero or decaying to zero exponentially fast. 
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Even if k*(t), T(t) are sufficiently small, i.e. , a(t), b(t ) vary sufficiently slowly 
with time, the boundedness of ei ,k,l can not be assured from the properties 
of V, V given by (8.3.28), (8.3.29) no matter how small a, b are. 

It has been established in [9] by using a different Lyapunov-like function 
and additional arguments that the above adaptive control scheme guarantees 
signal boundedness for any bounded a(t), b(t ) with arbitrary but finite speed 
of variation. This approach, however, has been restricted to a special class 
of plants and has not been extended to the general case. Furthermore, the 
analysis does not address the issue of tracking error performance that is 
affected by the parameter variations. 

The above example demonstrates that the empirical observation that an 
adaptive controller designed for an LTI plant will also perform well when the 
plant parameters are slowly varying with time, held for a number of years in 
the area of adaptive control as a justification for dealing with LTI plants, may 
not be valid. In Chapter 9, we briefly address the adaptive control problem 
of linear time varying plants and show how the adaptive and control laws 
designed for LTI plants can be modified to meet the control objective in the 
case of linear plants with time varying parameters. For more details on the 
design and analysis of adaptive controllers for linear time varying plants, the 
reader is referred to [226]. 


8.4 Modifications for Robustness: Simple Exam- 
ples 

The instability examples presented in Section 8.3 demonstrate that the adap- 
tive schemes designed in Chapters 4 to 7 for ideal plants, i.e., plants with no 
modeling errors may easily go unstable in the presence of disturbances or un- 
modeled dynamics. The lack of robustness is primarily due to the adaptive 
law which is nonlinear in general and therefore more susceptible to modeling 
error effects. 

The lack of robustness of adaptive schemes in the presence of bounded 
disturbances was demonstrated as early as 1979 [48] and became a hot issue 
in the early 1980s when several adaptive control examples are used to show 
instability in the presence of unmodeled dynamics and bounded disturbances 
[86, 197]. It was clear from these examples that new approaches and adaptive 
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laws were needed to assure boundedness of all signals in the presence of plant 
uncertainties. These activities led to a new body of work referred to as robust 
adaptive control. 

The purpose of this section is to introduce and analyze some of the 
techniques used in the 1980s to modify the adaptive laws designed for ideal 
plants so that they can retain their stability properties in the presence of 
“reasonable” classes of modeling errors. 

We start with the simple plant 


y = 9*u + y (8.4.1) 

where y is the unknown modeling error term and y, u are the plant output 
and input that are available for measurement. Our objective is to design an 
adaptive law for estimating 9* that is robust with respect to the modeling 
error term y. By robust we mean the properties of the adaptive law for y ^ 0 
are “close” ( within the order of the modeling error) to those with ij = 0. 
Let us start with the adaptive law 

6 = jei u, c\ = y — 9u (8.4.2) 

developed in Section 4.2.1 for the plant (8.4.1) with y = 0 and u £ £ 0 Q , and 
shown to guarantee the following two properties: 

(i) €1,6,6 £ Too, (ii) €1,6 e £.2 (8.4.3) 

If in addition u £ , the adaptive law also guarantees that e \ , 9 — > 0 as 

t — > oo. When y = 0 but u 0 instead of (8.4.2) we use the normalized 
adaptive law 

6 = 7 €u, e = - — (8.4.4) 
m z 

where m is the normalizing signal that has the property of — £ C^. A 
typical choice for m in this case is m 2 = 1 + u 2 . The normalized adaptive 
law (8.4.4) guarantees that 

(i) €,€m,9,6£C oo, (ii) e, em, 6 £ £2 (8.4.5) 

We refer to properties (i) and (ii) given by (8.4.3) or (8.4.5) and established 
for y = 0 as the ideal properties of the adaptive laws. When 7j 7^ 0, the 
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adaptive law (8.4.2) or (8.4.4) can no longer guarantee properties (i) and 
(ii) given by (8.4.3) or (8.4.5). As we have shown in Section 8.3.1, we can 
easily find a bounded modeling error term r/, such as a bounded output 
disturbance, that can cause the parameter estimate 9(t) to drift to infinity. 

In the following sections, we introduce and analyze several techniques 
that can be used to modify the adaptive laws in (8.4.2) and (8.4.4). The 
objective of these modifications is to guarantee that the properties of the 
modified adaptive laws are as close as possible to the ideal properties (i) and 
(ii) given by (8.4.3) and (8.4.5) despite the presence of the modeling error 
term ij ^ 0. 

We first consider the case where r\ is due to a bounded output disturbance 
and u G Too- We then extend the results to the case where u G Too and 
rj is due to a dynamic plant uncertainty in addition to a bounded output 
disturbance. 

8.4.1 Leakage 

The idea behind leakage is to modify the adaptive law so that the time deriva- 
tive of the Lyapunov function used to analyze the adaptive scheme becomes 
negative in the space of the parameter estimates when these parameters ex- 
ceed certain bounds. We demonstrate the use of leakage by considering the 
adaptive law (8.4.2) 

6 = 761 u, €\ — y — 6u 

with u G Too for the plant (8.4.1), which we rewrite in terms of the parameter 
error 6 = 9 — 9* as 

9 = 7 d u, ei = — 9u + 77 (8.4.6) 

The modeling error term 7, which in this case is assumed to be bounded, 
affects the estimation error which in turn affects the evolution of 6(t). Let 
us consider the Lyapunov function 

V(9) = I (8.4.7) 

27 

that is used to establish properties (i) and (ii), given by (8.4.3), when r] = 0. 
The time derivative of V along the solution of (8.4.6) is given by 

V = 9eiu = -el + ei rj < -|ei|(|ei| - do) 


(8.4.8) 



8.4. MODIFICATIONS FOR ROBUSTNESS: SIMPLE EXAMPLES 557 


where do > 0 is an upper bound for the error term r/. From (8.4.7) and 
(8.4.8), we cannot conclude anything about the boundedness of 6 because 
the situation where |ei| < do, i.e. , V > 0 and 6{t) — > oo as t — > oo cannot be 
excluded by the properties of V and V for all input signals u £ . 

One way to avoid this situation and establish boundedness in the presence 
of r] is to modify the adaptive law as 

6 = 761 u — 'ywO, e\ = y — Qu (8.4.9) 

where the term wO with w > 0 converts the “pure” integral action of the 
adaptive law given by (8.4.6) to a “leaky” integration and is therefore re- 
ferred to as the leakage modification. The design variable w(t) > 0 is to 
be chosen so that for V > Vo > 0 and some Vo, which may depend on 
do, the time derivative V < 0. Such a property of V, V will allow us to 
apply Theorem 3.4.3 and establish boundedness for V and, therefore, for 
6. The stability properties of the adaptive law (8.4.9) are described by the 
differential equation 

9 = ■yeiu — 'ywO, e± = —9u + r] (8.4.10) 

Let us analyze (8.4.10) for various choices of the leakage term w(t). 

(a) a -modification [85, 86]. The simplest choice for w(t) is 

w(t) = a > 0, Vf > 0 

where a is a small constant and is referred to as the fixed a -modification. 
The adaptive law becomes 

6 = 7d u — qu 0 

and in terms of the parameter error 

6 = 'ye\u — 7<70, e = —9u + r] (8.4.11) 

The time derivative of V = along any trajectory of (8.4.11) is given by 
V = —ef + ei?7 — (i69 < —el + |ei|do — a99 


(8.4.12) 
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Using completion of squares, we write 


-ej + | Cl |do < -f - |[ci - do] 2 + f < - f + f 


—0-00 = — <7 9(9 + 9*) < —a9 2 + cr 


CT 0 2 £7 0 * 2 

< h — — !- 

“22 


(8.4.13) 


Therefore, 


- 2 2 2 2 


Adding and subtracting the term aF for some a > 0, we obtain 


V < —aV - 4 - a - 


If we choose 0 < a < cry, we have 


0 2 dl a\9* 


TJ 2 


+ + 


r<-«r + f + ^ 


(8.4.14) 


which implies that for V > Vo = ^(do + ° |d*| 2 ),U < 0. Therefore, using 
Theorem 3.4.3, we have that 6, 6 £ /loo which, together with u £ /loo, imply 
that ei,0 £ /loo. Hence, with the cr-modification, we managed to extend 
property (i) given by (8.4.3) for the ideal case to the case where 77 / 0 
provided 77 £ /loo- In addition, we can establish by integrating (8.4.14) that 

t nat+w — ® ^ „—at@ (0) 1 1 ( j 2 , i/n*i2\ 


U(0(t)) = — < e 
27 


^+^+<rin*) 


which implies that 9 converges exponentially to the residual set 

D a = \d £ PL 9 2 < ^-(d 2 0 + a\9*\ 2 ) | 

Let us now examine whether we can extend the ideal property (ii) e±, 
9 £ £2 given by (8.4.3) for the case of 77 = 0 to the case where 77 / 0 but 
77 £ /loo- 

We consider the following expression for V : 


ef 9 2 d 2 
v < — 4 — £ 7 — + S + a'~ 


(8.4.15) 
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obtained by using the inequality 

a 2 i/i* 1 2 

— <700 < — <7 ~z — b <7 — — ■ — 
2 2 

instead of (8.4.13). Integrating (8.4.15) we obtain 


/*t+T rt-\-± 

/ efdr+ / u0 2 dr < (rf§ + cr| | 2 )fT +2[1^ (t) — (i+iF)] < c 0 (dl + cr)T+ci 

Jt Jt 

(8.4.16) 

\/t > 0 and T > 0 where cq = max[ 1, |0*| 2 ],ci = 2 supjE (t) — V(t + T)]. 
Expression (8.4.16) implies that e\, \fo9 are (dg+cr) -small in the mean square 
sense (rn.s.s.), i.e., ei, \J~o9 G 5(c?q + a). Because 


rt+T 


\e\ 2 < 2^ 2 e{u 2 + 2 7 2 cr 2 6» 2 

it follows that 9 is also {d'f } + fj)-small in the rn.s.s., i.e., 9 G S{d q + a) 
Therefore, the ideal property (ii) extends to 


(ii)' ei,6 G S(dl + a) 


(8.4.17) 


The £2 property of ei, 9 can therefore no longer be guaranteed in the presence 
of the nonzero modeling error term ? 7 . The £2 property of ei was used in 
Section 4.2.1 together with u G Coo to establish that ei(f) — > 0 as t — > 00. It 
is clear from (8.4.17) and the analysis above that even when the disturbance 
r] disappears, i.e., 77 = 0 , the adaptive law with the er-modification given 
by (8.4.11) cannot guarantee that e\ ,9 G £2 or that e\ (t) — ► 0 as t — > 00 
unless a = 0. Thus robustness is achieved at the expense of destroying the 
ideal property given by (ii) in (8.4.3) and having possible nonzero estimation 
errors at steady state. This drawback of the cr-modification motivated what 
is called the switching cr-modification described next. 


(b) Switching-c r [91]. Because the purpose of the cr-modification is to avoid 
parameter drift, it does not need to be active when the estimated parameters 
are within some acceptable bounds. Therefore a more reasonable modifica- 
tion would be 


w(t) = a s , a. 


0 if |0| < M 0 
a 0 if | 0 | > Mo 
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where M$ > 0, 07 > 0 are design constants and Mo is chosen to be large 
enough so that Mo > \9* \ . With the above choice of a s . however, the adaptive 
law (8.4.9) is a discontinuous one and may not guarantee the existence of the 
solution 9(t). It may also cause oscillations on the switching surface \9\ = Mo 
during implementation. The discontinuous switching cr-modification can be 
replaced with a continuous one such as 

r o if |0(t)i < Mo 

w(t) = a s , a s = | (To - l) if M 0 < | 6 »(f)| < 2 M 0 (8.4.18) 

{ a 0 i£\9(t)\>2M 0 

shown in Figure 8.4 where the design constants Mo,ao are as defined in 
the discontinuous switching cr-modification. In this case the adaptive law is 
given by 

9 = yei u — r ya s 9 , e\ = y — 9u 

and in terms of the parameter error 

9 = 7 d u — 7 a s 9, e\ = — 9u + rj (8.4.19) 

The time derivative of V(9) = ^ along the solution of (8.4.18), (8.4.19) is 
given by 

2 h2 

V = -e\ + e\i] - a s 99 < - y - a s 99 + y (8.4.20) 

Now 

a s 99 = a s {9 - 9*)9 > a s \9\ 2 - a s \9\\9*\ 

> a s \9\(\9\-Mo + M 0 -\9*\) 



8.4. MODIFICATIONS FOR ROBUSTNESS: SIMPLE EXAMPLES 561 


i.e., 

a s ee > a a \e\(\e\ - m 0 ) + <t s \o\(m 0 - \e * |) > o (8.4.21) 

where the last inequality follows from the fact that a s > 0, cr s (|0| — Mo) > 0 
and Mq > |0*|. Hence, —o s 99 < 0. Therefore, in contrast to the fixed cr- 
modification, the switching a can only make V more negative. We can also 
verify that 

- a s 99 < -a 0 99 + 2 a 0 M$ (8.4.22) 

which we can use in (8.4.20) to obtain 

e 2 - d 2 

V < — y - a 0 99 + 2 a 0 M 0 2 + -j- (8.4.23) 

The boundedness of V and, therefore, of 9 follows by using the same pro- 
cedure as in the fixed a case to manipulate the term cjq99 in (8.4.23) and 
express V as 

V < -aV + 2 u 0 M 0 2 + ^ + u 0 & 

where 0 < a < U07 which implies that 9 converges exponentially to the 
residual set 

D s = 1 0 \~9 \ 2 < ^(^ + u 0 |r| 2 + 4uoM 2 )| 

The size of D s is larger than that of D a in the fixed er-modification case 
because of the additional term AgqMq. The boundedness of 9 implies that 
9, 9 , e\ e Coo and, therefore, property (i) of the unmodified adaptive law in 
the ideal case is preserved by the switching er-modification when ^ 0 but 
V ^ Coo- 

As in the fixed er-modification case, the switching a cannot guarantee 
the C 2 property of e\ ,9 in general when r/ 7^ 0. The bound for e\ in rn.s.s. 
follows by integrating (8.4.20) to obtain 

rt-\-T ~ rt-\-T 

2 / a s 09dr + / e\dr < d^T + c\ 

Jt Jt 
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for some constant c that depends on the bound for cro|0|, which implies that 
a s \0\ £ 5(c?o)- Because 

\9\ 2 < 2 r j 2 e 2 u 2 + 2 r ) 2 a 2 s \9\ 2 

it follows that \9\ £ S((Iq). Hence, the adaptive law with the switching-er 
modification given by (8.4.19) guarantees that 

(ii)' ei ,9eS(dl) 

In contrast to the fixed cr-modification, the switching a preserves the ideal 
properties of the adaptive law, i.e., when r/ disappears ( rj = do = 0), equation 

(8.4.20) implies (because —a s 96 < 0) that e\ £ £2 and \J a s 99 £ £ 2 , which, 
in turn, imply that 0 £ £ 2 - In this case if u £ £oo, we can also establish 
as in the the ideal case that ei(t), 0 (f) — > 0 as t —+ 00 , i.e., the switching 
a does not destroy any of the ideal properties of the unmodified adaptive 
law. The only drawback of the switching a when compared with the fixed 
ex is that it requires the knowledge of an upper bound Mq for |0*|. If Mq in 
(8.4.18) happens not to be an upper bound for \9*\, then the adaptive law 
(8.4.18) has the same properties and drawbacks as the fixed cr-modification 
(see Problem 8.7). 

(c) e\ -modification [172], Another attempt to eliminate the main drawback 
of the fixed cr-modification led to the following modification: 

w(t) = |ei \v 0 

where > 0 is a design constant. The adaptive law becomes 

9 = 'yeiu - ~i\ei\vo9, e x = y - 9u 

and in terms of the parameter error 

9 = jeiu — 7 |ei|ic o 0 , e\ = — 9u + ij (8.4.24) 

The logic behind this choice of w is that because in the ideal case t\ is 
guaranteed to converge to zero (when ii £ £ 00 ) > then the leakage term w(t)9 
will go to zero with e± when 77 = 0 ; therefore, the ideal properties of the 
adaptive law (8.4.24) when r/ = 0 will not be affected by the leakage. 
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by 


The time derivative of V(0) = along the solution of (8.4.24) is given 


V = -e\ + ei rj - \e\\v 0 60 < -|ei| f|ei| + ^077 - ^ 0^77 ^ (8.4.25) 


~ R 2 \ 0*\2 

where the inequality is obtained by using uq66 < + ^ 0^2 • ^ 

clear that for |ei| + ^077 > + ^o> be., for V > Vo = ^(z'o^j- + ^ 0 ) 

we have V < 0, which implies that V and, therefore, 6,6 G £00 • Because 
61 = — 6u + rj and u G £oo we also have that e\ G £oo, which, in turn, implies 
that 0 G £oo. Hence, property (i) is also guaranteed by the ei-modification 
despite the presence of rj 7 ^ 0 . 

Let us now examine the £2 properties of ei, 6 guaranteed by the unmod- 
ified adaptive law (w(t) = 0) when 77 = 0. We rewrite (8.4.25) as 




+ eiko 


I* 


2^2 


by using the inequality —a 2 ± a& < — \ + \. If we repeat the use of the 
same inequality we obtain 


e 2 d 2 

v < -4 + ^ + h 




1* 1 4 


(8.4.26) 


Integrating on both sides of (8.4.26), we establish that e\ G <S(dg + z/q). 
Because tz, 0 G £ 00 , it follows that |0| < c|ei| for some constant c > 0 and 
therefore 0 G 5(dg + z/q). Hence, the adaptive law with the ei-nrodification 
guarantees that 

(ii)' ei,0 G 5(dg + z/g) 

which implies that e\, 6 are of order of do, v 0 in rn.s.s. 

It is clear from the above analysis that in the absence of the disturbance 
i.e., 77 = 0, V cannot be shown to be negative definite or semidefinite unless 
vo = 0. The term \ei\uo66 in (8.4.25) may make V positive even when 77 = 0 
and therefore the ideal properties of the unmodified adaptive law cannot 
be guaranteed by the adaptive law with the ei-nrodification when 77 = 0 
unless uq = 0. This indicates that the initial rationale for developing the 
ei-nrodification is not valid. It is shown in [172], however, that if u is PE, 
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then ei(t) and therefore w(t) = vo\e\ (t)| do converge to zero as t — > oo when 
rj(t) = 0 ,Vt > 0. Therefore the ideal properties of the adaptive law can be 
recovered with the ei-nrodification provided u is PE. 


Remark 8.4.1 


(i) Comparing the three choices for the leakage term w(t), it is clear that 
the fixed a- and ei-nrodification require no a priori information about 
the plant, whereas the switching-a requires the design constant Mq to 
be larger than the unknown |0*|. In contrast to the fixed a- and ei- 
modifications, however, the switching a achieves robustness without 
having to destroy the ideal properties of the adaptive scheme. Such 
ideal properties are also possible for the ei-nrodification under a PE 
condition[172], 

(ii) The leakage — w9 with w(t) > 0 introduces a term in the adaptive law 
that has the tendency to drive 9 towards 9 = 0 when the other term 
(i.e., 761 u in the case of (8.4.9)) is small. If 9* / 0, the leakage term 
may drive 9 towards zero and possibly further away from the desired 
9*. If an a priori estimate 9* of 9* is available the leakage term — w9 
may be replaced with the shifted leakage —w(9 — 9*), which shifts the 
tendency of 9 from zero to 9*, a point that may be closer to 9*. The 
analysis of the adaptive laws with the shifted leakage is very similar to 
that of — w9 and is left as an exercise for the reader. 

(iii) One of the main drawbacks of the leakage modifications is that the es- 
timation error ei and 9 are only guaranteed to be of the order of the 
disturbance and, with the exception of the switching cr-modification, of 
the order of the size of the leakage design parameter, in rn.s.s. This 
means that at steady state, we cannot guarantee that e\ is of the order 
of the modeling error. The rn.s.s. bound of e± may allow ei to exhibit 
“bursting,” i.e., e\ may assume values higher than the order of the mod- 
eling error for some finite intervals of time. One way to avoid bursting 
is to use PE signals or a dead-zone modification as it will be explained 
later on in this chapter 
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(iv) The leakage modification may be also derived by modifying the cost 

'i _ 

2 2 


function J{9) = 'A = — ; used in the ideal case, to 


Using the gradient method, we now obtain 

9 = — yVJ = yeirt — yrc# 


(8.4.27) 


which is the same as (8.4.9). The modified cost now penalizes 9 in 
addition to e± which explains why for certain choices of w(t) the drifting 
of 9 to infinity due to the presence of modeling errors is counteracted. 


8.4.2 Parameter Projection 

An effective method for eliminating parameter drift and keeping the param- 
eter estimates within some a priori defined bounds is to use the gradient pro- 
jection method to constrain the parameter estimates to lie inside a bounded 
convex set in the parameter space. Let us illustrate the use of projection for 
the adaptive law 

9 = yei u, e\ = y — 9u 

We like to constrain 9 to lie inside the convex bounded set 



where Mq > |0*|. Applying the gradient projection method, we obtain 

f 7£i u if \9\ < Mo 

9 = 9=1 or if |#| = Mo and 9e\u < 0 (8.4.28) 

y 0 if \9\ = Mo and 9e\u > 0 

ei = y — 9u=—9u + ri 


which for 1 0(0) | < Mo guarantees that |0(f)| < Mo,Vt > 0. Let us now 
analyze the above adaptive law by considering the Lyapunov function 
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whose time derivative V along (8.4.28) is given by 
( — ef + ei rj if |0| < M 0 

V = < or if |0| = Mo and 9e\ u < 0 (8.4.29) 

y 0 if |0| = Mo and 9e\u > 0 

Let us consider the case when V = 0, \9\ = Mo and 9e\u > 0. Using the 
expression = — 0u + r/, we write V = 0 = — e 2 + e\ r/ — 9e\u. The last term 
in the expression of V can be written as 

9e\u = (9 — 9*)e\u = Mosgn(9)eiu — 9*e\u 
Therefore, for 9e\u > 0 and |0| = M$ we have 

9e\u = Mo|eirt| — 9*e\u > Mo|eiu| — |0*||ei«| > 0 


where the last inequality is obtained by using the assumption that Mo > 16**1, 
which implies that for \9\ = Mo and 9e\u > 0 we have 9e \u > 0 and that 
V = 0 < — ef + ei rj. Therefore, (8.4.29) implies that 

p 2 d 2 

V<-ej + e lV <-^ + f, Vt> 0 (8.4.30) 

A bound for e\ in rn.s.s. may be obtained by integrating both sides of 
(8.4.30) to get 


rt-\-T 

j e\dT<dlT + 2{V{t)-V{t + T)} 

\/t > 0 and any T > 0. Because V £ 0^, it follows that e\ £ S(d'f } ). 

The projection algorithm has very similar properties and the same ad- 
vantages and disadvantages as the switching cr-modification. For this reason, 
the switching cr-modification has often been referred to as “soft projection.” 
It is soft in the sense that it allows |6*(t)| to exceed the bound Mo but it does 
not allow \9(t,)\ to depart from Mq too much. 


8.4.3 Dead Zone 

Let us consider the adaptive error equation (8.4.6) i.e., 

9 = yei u, ei = —9u + rj 


(8.4.31) 
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(b) 


Figure 8.5 Dead zone functions: (a) discontinuous (b) continuous. 


Because sup t |r/(i)| < do, it follows that if |ei| S> do then the signal 0u 
is dominant in e\. If, however, |ei| < do then 7j may be dominant in e\. 
Therefore small e± relative to do indicates the possible presence of a large 
noise (because of rj) to signal (9u) ratio, whereas large e\ relative to do 
indicates a small noise to signal ratio. Thus, it seems reasonable to update 
the parameter estimate 6 only when the signal 9u in ei is large relative to 
the disturbance r] and switch-off adaptation when e\ is small relative to the 
size of r /. This method of adaptation is referred to as dead zone because 
of the presence of a zone or interval where 9 is constant, i.e. , no updating 
occurs. The use of a dead zone is illustrated by modifying the adaptive law 
9 = 7 d u used when rj = 0 to 

9 = 7 «(ei + g), <? = { ° j! M (8.4.32) 

( — ei it |ei| < go 

e\ = y — 9u 

where go > 0 is a known strict upper bound for |r/(f)|, i.e., go > do > 
sup t |?/(t)|. The function /(e i) = e\ + g is known as the dead zone function 
and is shown in Figure 8.5 (a). 

It follows from (8.4.32) that for small e\, i.e., |ei| < go, we have 0 = 0 
and no adaptation takes place, whereas for “large” ei, i.e., |ei| > go, we 
adapt the same way as if there was no disturbance, i.e., 9 = 761 u. Because 
of the dead zone, 9 in (8.4.32) is a discontinuous function which may give 
rise to problems related to existence and uniqueness of solutions as well as 
to computational problems at the switching surface [191]. One way to avoid 
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these problems is to use the continuous dead zone shown in Figure 8.5(b). 

Using the continuous dead-zone function from Figure 8.5 (b), the adap- 
tive law (8.4.32) becomes 



f 9o 

if ei < -g 0 


, a = \ 

, ~9o 

if ei > 9o 

(8.4.33) 

1 

{ 

if M < 9o 


ei = — 9u + 77 




which together with 


describe the stability properties of the modified adaptive law. We analyze 
(8.4.33) by considering the Lyapunov function 


v-t 

2y 


whose time derivative V along the trajectory of (8.4.33) is given by 

V = 9u(e i + g) = -(ei - rj)(e i + g) (8.4.34) 


Now, 

( (ei -77)(ei + 50 ) > 0 if ei < -go < -\rj\ 

(ei - 77 ) (ei + g) = < (ei - 77 ) (ei - go) > 0 if ei > g 0 > \rj\ 

[ 0 if |ei| < g 0 

i.e. , (ei — 77 ) (ei + g) > 0 Vf > 0 and, therefore, 

V = -(ei - 77 ) (e 1 + g) < 0 

which implies that V,9,9 G Cx and that \/(ei — 77 ) (ei + g) G £2 • The 
boundedness of 9,u implies that e\,9 G £ 00 • Hence, the adaptive law with 
the dead zone guarantees the property e \ , 0 G £00 in the presence of nonzero 
77 G £oo . Let us now examine the £2 properties of ei , 9 that are guaranteed 
by the unmodified adaptive law when 77 = 0. We can verify that 


(ei + g ) 2 < (ei -v)(o + g) 

for each choice of g given by (8.4.33). Since i/(ei — 77 ) (ei + g) G £2 it follows 
that (ei + 77 ) G £2 which implies that 9 G £ 2 . Hence, the dead zone preserves 
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the £2 property of 9 despite the presence of the bounded disturbance //. Let 
us now examine the properties of e\ by rewriting (8.4.34) as 

V = -e\- ei g + + gg 

< ~ e i + l^ilso + l e i|^o + dogo 


By completing the squares, we obtain 


V < 


4 {d 0 + g 0 )' 2 

2 + 2 


+ dogo 


which implies that e\ £ S(do + go)- Therefore, our analysis cannot guarantee 
that 61 is in £2 but instead that is of the order of the disturbance and 
design parameter <70 i n m.s.s. Furthermore, even when g = 0, i.e., do = 0 we 
cannot establish that e\ is in £2 unless we set go = 0. 

A bound for |ei| at steady state may be established when 11,77 are uni- 
forrnly continuous functions of time. The uniform continuity of u, 77, together 
with 9 £ £00, implies that = —9u + 77 is uniformly continuous, which, in 
turn, implies that e\ + g is a uniformly continuous function of time. Because 
e\ + g £ £2, it follows from Barbalat’s Lemma that |ei + g\ — > 0 as t — ► 00 
which by the expression for g in (8.4.33) implies that 


lim sup|ei(r)| < go (8.4.35) 

T>t 

The above bound indicates that at steady state the estimation error e\ is 
of the order of the design parameter go which is an upper bound for 1 77(f) |. 
Hence if go is designed properly, phenomena such as bursting (i.e., large 
errors relative to the level of the disturbance at steady state and over short 
intervals of time) that may arise in the case of the leakage modification and 
projection will not occur in the case of dead zone. 

Another important property of the adaptive law with dead-zone is that 
it guarantees parameter convergence, something that cannot be guaranteed 
in general by the unmodified adaptive laws when 77 = 0 unless the input u 
is PE. We may establish this property by using two different methods as 
follows: 

The first method relies on the simplicity of the example and is not ap- 
plicable to the higher order case. It uses the property of V = and V < 0 
to conclude that V(t) and, therefore, 9{t) have a limit as t — > 00. 



570 


CHAPTER 8. ROBUST ADAPTIVE LAWS 


The second method is applicable to the higher order case as well and 
proceeds as follows: 

We have 

V = -(ei - 77 ) (ei + g) = — |ei - ??||ei + g\ (8.4.36) 

where the second equality holds due to the choice of g. Because |ei + g\ = 0 
if | e i| < < 70 ) it follows from (8.4.36) that 

V = -| ei - 77 I | ei + g\<-\g 0 ~ | 7 /|||ei + g\ (8.4.37) 

Integrating on both sides of (8.4.37), we obtain 

roo 

inf bo - b(t)|| / bi + g\dr 
t Jo 

roc 

< / \ei - rj\\ei + g\dr = V (0) - Voo 

Jo 

Because go > sup t \rj(t)\, it follows that e\ +g £ C\. We have \9\ < 717 x 1161+51 
which implies that 9 £ C\ which in turn implies that 

lim [ 9dr = lim 9(t) — 9(0) 

t — >OC JQ t — >00 

exists and, therefore, 9(t) converges to some limit 9. 

The parameter convergence property of the adaptive law with dead zone 
is very helpful in analyzing and understanding adaptive controllers incorpo- 
rating such adaptive law. The reason is that the initially nonlinear system 
that represents the closed-loop adaptive control scheme with dead zone con- 
verges to an LTI one as t — » 00 . 

One of the main drawbacks of the dead zone is the assumption that an 
upper bound for the modeling error is known a priori. As we point out in 
later sections, this assumption becomes more restrictive in the higher order 
case. If the bound of the disturbance is under estimated, then the properties 
of the adaptive law established above can no longer be guaranteed. 

Another drawback of the dead zone is that in the absence of the distur- 
bance, i.e., i](t) = 0 , we cannot establish that €1 £ £2 and/or that ei(t) — » 0 
as t — > 00 (when u £ £oo) unless we remove the dead-zone, i.e., we set go = 0 . 
Therefore robustness is achieved at the expense of destroying some of the 
ideal properties of the adaptive law. 
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8.4.4 Dynamic Normalization 

In the previous sections, we design several modified adaptive laws to estimate 
the parameter 9* in the parametric model 

y = 9*u + y (8.4.38) 

when u 6 and 77 is an unknown bounded signal that may arise because 
of a bounded output disturbance etc. 

Let us now extend the results of the previous sections to the case where 
u is not necessarily bounded and ij is either bounded or is bounded from 
above by |w| i.e., 1 77 I < ci|u| + C 2 for some constants ci,C 2 > 0. In this case 
a normalizing signal such as m 2 = 1 + u 2 may be used to rewrite (8.4.38) as 

y = 9*u + fj 

where x = — denotes the normalized value of x. Since with m 2 = 1 + u 2 we 

m 

have y,u,fj £ Too, the same procedure as in the previous sections may be 
used to develop adaptive laws that are robust with respect to the bounded 
modeling error term fj. The design details and analysis of these adaptive 
laws is left as an exercise for the reader. 

Our objective in this section is to go a step further and consider the case 
where 77 in (8.4.38) is not necessarily bounded from above by |u| but it is 
related to u through some transfer function. Such a case may arise in the 
on-line estimation problem of the constant 9* in the plant equation 

y = 9*{l + A m {s))u + d (8.4.39) 

where A m (s) is an unknown multiplicative perturbation that is strictly 
proper and stable and d is an unknown bounded disturbance. Equation 
(8.4.39) may be rewritten in the form of (8.4.38), i.e., 

y = 9*u + r/, 77 = 9*A m (s)u + d (8.4.40) 

To apply the procedure of the previous sections to (8.4.40), we need to 
find a normalizing signal m that allow us to rewrite (8.4.40) as 


y = 9*u + 77 


(8.4.41) 
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where y = ^,u = ^,y = ^ and y, u,y <E Because of A m (s) in (8.4.40), 
the normalizing signal given by m 2 = 1 + u 2 is not guaranteed to bound y 
from above. A new choice for m is found by using the properties of the C 2 s 
norm presented in Chapter 3 to obtain the inequality 

\y{t)\ < |^* 1 1| A m (s) 1 1 2<5 II 1 1 25 + \d(t)\ (8.4.42) 

that holds for any finite 6 > 0 provided A m (s) is strictly proper and analytic 
in Re[s] > — If we now assume that in addition to being strictly proper, 
A m (s) satisfies the following assumption: 

Al. A m (s) is analytic in Re[s] > — ^ for some known So > 0 

then we can rewrite (8.4.42) as 

\v(t)\ < Po\\ut\\ 28 o + do (8.4.43) 

where yo = |0*|||A m (s)||2<5 o and do is an upper bound for \d(t)\. Because 
yoido are constants, inequality (8.4.43) motivates the normalizing signal m 
given by 

m 2 = 1 + u 2 + \\u t \\ls 0 (8.4.44) 

that bounds both u, y from above. It may be generated by the equations 

rh s = —5om s + u 2 , m s ( 0 ) = 0 

n 2 s = m. s , m 2 = 1 + u 2 + n 2 (8.4.45) 

We refer to m s = n 2 as the dynamic normalizing signal in order to distinguish 
it from the static one given by m 2 = 1 + u 2 . 

Because m bounds u, y in (8.4.40), we can generate a wide class of adap- 
tive laws for the now bounded modeling error term y = — by following the 
procedure of the previous sections. By considering the normalized equation 
(8.4.41) with m given by (8.4.45) and using the results of Section 8.3.1, we 
obtain 

6 = qef u — ywO (8.4.46) 

where 

_ - y - 

ei = y-y= — 

n 

and w is the leakage term to be chosen. 


y = Ou 
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As in the ideal case considered in Chapter 4, we can express (8.4.46) in 
terms of the normalized estimation error 

y-y - a 

e = y = 9u 

rrP 

to obtain 

0 = 7 eu — 7 wO (8.4.47) 

or we can develop (8.4.47) directly by using the gradient method to minimize 

e 2 m 2 w 0 2 (y ~ ® u ) 2 w ® 2 

m = — + — = 2m2 + — 

with respect to 9. 

Let us now summarize the main equations of the adaptive law (8.4.47) 
for the parametric model y = 9*u + y given by (8.4.40). We have 

8 = 'yeu — 'yw9, e =- — , y = 9u 

m 2 = l + u 2 + n^, m s = n 2 (8.4.48) 

rn s = -8om s + u 2 , m s ( 0) = 0 


The analysis of (8.4.48) for the various choices of the leakage term w(t) is 
very similar to that presented in Section 8.4.1. As a demonstration, we 
analyze (8.4.48) for the fixed er-modification, i.e. , w(t) = a. From (8.4.40), 
(8.4.48) we obtain the error equations 


- 9-11 + 11 ~ 

e = x — , 9 = 7 eu — 'yad 


(8.4.49) 


where 9 = 9 — 9*. We consider the Lyapunov function 


^ 2 7 

whose time derivative V along the solution of (8.4.49) is given by 

V = eOu — a99 = —e 2 m 2 + ey — a99 
By completing the squares, we obtain 

e 2 m 2 o8 2 if a\9*\ 2 

V < 1 1 ■ — — 

2 2 2m 2 2 


2 
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~ 2 

Because ^ G T oo it follows that V, 0 G T^ and em Because 

0 = 7 — r ya9 and “ G T^, we can establish as in Section 8.4.1 that 

9 G + <t). The boundedness of 9,^,^ also implies by (8.4.49) that 

e, em, 0 G Too- The properties of (8.4.48) with w = a are therefore given by 

• (i) e, em, 9, 9 G Too 

• (ii) e,em,/9 G S( ^ + a) 

In a similar manner, other choices for w(t) and modifications, such as 
dead zone and projections, may be used together with the normalizing signal 
(8.4.45) to design adaptive laws that are robust with respect to the dynamic 
uncertainty 7 . 

The use of dynamic normalization was first introduced in [48] to handle 
the effects of bounded disturbances which could also be handled by static 
normalization. The use of dynamic normalization in dealing with dynamic 
uncertainties in adaptive control was first pointed out in [193] where it was 
used to develop some of the first global results in robust adaptive control for 
discrete-time plants. The continuous-time version of the dynamic normal- 
ization was first used in [91, 113] to design robust MRAC schemes for plants 
with additive and multiplicative plant uncertainties. Following the work of 
[91, 113, 193], a wide class of robust adaptive laws incorporating dynamic 
normalizations are developed for both continuous- and discrete-time plants. 


Remark 8.4.2 The leakage, projection, and dead-zone modifications are 
not necessary for signal boundedness when u in the parametric model 
(8.4.1) is bounded and PE and 7 G Too as indicated in Section 8.3.1. 
The PE property guarantees exponential stability in the absence of 
modeling errors which in turn guarantees bounded states in the pres- 
ence of bounded modeling error inputs provided the modeling error 
term doesnot destroy the PE property of the input. In this case the 
steady state bounds for the parameter and estimation error are of the 
order of the modeling error, which implies that phenomena, such as 
bursting, where the estimation error assumes values larger than the 
order of the modeling error at steady state, are not present. The use 
of PE signals to improve robustness and performance is discussed in 
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subsequent sections. When u, q are not necessarily bounded, the above 
remarks hold provided — G is PE. 


8.5 Robust Adaptive Laws 


In Chapter 4 we developed a wide class of adaptive laws for estimating 
on-line a constant parameter vector 9* in certain plant parametric models. 
The vector 9* could contain the unknown coefficients of the plant transfer 
function or the coefficients of various other plant parameterizations. The 
adaptive laws of Chapter 4 are combined with control laws in Chapters 
6, 7 to form adaptive control schemes that are shown to meet the control 
objectives for the plant model under consideration. 

A crucial assumption made in the design and analysis of the adaptive 
schemes of Chapters 4 to 7 is that the plant model is an ideal one, i.e., it 
is free of disturbances and modeling errors and the plant parameters are 
constant for all time. 

The simple examples of Section 8.3 demonstrate that the stability prop- 
erties of the adaptive schemes developed in Chapters 4 to 7 can no longer 
be guaranteed in the presence of bounded disturbances, unmodeled plant 
dynamics and parameter variations. The main cause of possible instabili- 
ties in the presence of modeling errors is the adaptive law that makes the 
overall adaptive scheme time varying and nonlinear. As demonstrated in 
Section 8.4 using a simple example, the destabilizing effects of bounded dis- 
turbances and of a class of dynamic uncertainties may be counteracted by 
using simple modifications that involve leakage, dead-zone, projection, and 
dynamic normalization. In this section we extend the results of Section 8.4 
to a general class of parametric models with modeling errors that may arise 
in the on-line parameter estimation problem of a wide class of plants. 

We start with the following section where we develop several parametric 
models with modeling errors that are used in subsequent sections to develop 
adaptive laws that are robust with respect to uncertainties. We refer to such 
adaptive laws as robust adaptive laws. 
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8.5.1 Parametric Models with Modeling Error 
Linear Parametric Models 


Let us start with the plant 

y = G 0 (s)(l + A m (s))u + d (8.5.1) 

where 

0 -<*> - W) (8 ' 5 ' 2) 

represents the dominant or modeled part of the plant transfer function with 
Z(s) = b n -is n ~ l + b n -2S n ~ 2 + . . . + b\s + b 0 and R(s) = s n + a n _is n_1 + 
. . . + a\s + ao; A m (s) is a multiplicative perturbation and d is a bounded 
disturbance. 

We would like to express (8.5.1) in the form where the coefficients of 
Z(s),R(s) lumped in the vector 

b = [bn— 1 > b n — 2 f . . . ,b(j, a n — 1 , CLn— 2 ®o] 

are separated from signals as done in the ideal case, where A m = 0, d = 0, 
presented in Section 2.4.1. From (8.5.1) we have 

R.y = Zu + ZA m u + R/I (8.5.3) 


As in Section 2.4.1, to avoid the presence of derivatives of signals in the 
parametric model, we filter each side of (8.5.3) with where A(s) is a 
rnonic Hurwitz polynomial of degree n, to obtain 


R 


Z 


~pj = jU + Vm 


where 

ZA rn R 
V m = + ~^ d 

is the modeling error term because of A m , d. If instead of (8.5.1), we consider 
a plant with an additive perturbation, i.e., 


y = G 0 (s)u + A a (s)u + d 


we obtain 


R 


Z 
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where 


R A R , 

Va = a u + —d 


Similarly, if we consider a plant with stable factor perturbations, i.e., 

Nq(s) + Ai(s) 


V = 


Dq (s) + A 2 (s 


-u + d 


where 


Z(s) . . R(s) 
N o{s) = T77V > A)(«) = 


A(«)’ A(s) 

and Nq, Dq are proper stable transfer functions that are coprime, we obtain 


R 


Z 


A y = A“ + 


where 

R 

Vs = Am - A 2 y + (^ + ^2)d 


Therefore, without loss of generality we can consider the plant parameteri- 
zation 


fy=l u + v 


(8.5.4) 


where 

rj = A y (s)y + A u (s)u + d! (8.5.5) 


is the modeling error with A y , A u being stable transfer functions and d\ 
being a bounded disturbance and proceed as in the ideal case to obtain a 
parametric model that involves 6* . If we define 


A 
z = 




«n-l(») 

A / X U 

a (s) 


®n— 1 (®) 


where cti(s) = [s®, s* x , . . 

. , s, 1] T , as in Section 2.4.1, 

we can rewrite (8.5.4) 

in the form 


z = 9* r (j> + rj 

(8.5.6) 

or in the form 


y = 0*\ T (i> + V 

(8.5.7) 


where 0* X T = [6>J T , 6>| T - A T ] T , 9\ = [b n - 1 , . . . , 6 0 ] T , and 9\ = [a n _i, . . . , a 0 ] T ; 
A = [A n _i, . . . , Aq] t is the coefficient vector of A(s) — s n = A n _is n-1 -b • -+Ao. 
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The effect of possible nonzero initial conditions in the overall plant state- 
space representation may be also included in (8.5.6), (8.5.7) by following 
the same procedure as in the ideal case. It can be shown that the initial 
conditions appear as an exponentially decaying to zero term r/o , i.e. , 

2: = 0* T (j) + ?/ + r? 0 

y = 0\ T 4> + V + VO (8.5.8) 

where r/o is the output of the system 

uj = A c co, w(0) = loo 

r/o = CqU 

A c is a stable matrix whose eigenvalues are equal to the poles of A m (s) or 
A a (s), or Ai(s) and A 2 (s) and A(s), and the degree of ui is equal to the 
order of the overall plant. 

The parametric models given by (8.5.6) to (8.5.8) correspond to Param- 
eterization 1 in Section 2.4.1. Parameterization 2 developed in the same 
section for the plant with ?/ = 0 may be easily extended to the case of ?/ 7 ^ 0 
to obtain 

y = W m (s)9* x r + r/ 

^ = w m (s)d* T 'i/j + r] (8.5.9) 

where W m (s) is a stable transfer function with relative degree 1 and r/r = 

w- 1 ^. 

Parametric models (8.5.6), (8.5.7) and (8.5.9) may be used to estimate 
on-line the parameter vector 9* associated with the dominant part of the 
plant characterized by the transfer function Gq(s). The only signals available 
for measurements in these parametric models are the plant output y and the 
signals 4>, and z that can be generated by filtering the plant input u 
and output y as in Section 2.4.1. The modeling error term 77 due to the 
unmodeled dynamics and bounded disturbance is unknown, and is to be 
treated as an unknown disturbance term that is not necessarily bounded. 
If, however, A y ,A u are proper then d\,y,u € Too will imply that 77 € Too- 
The properness of A y ,A u may be established by assuming that Go(s) and 
the overall plant transfer function are proper. In fact A y , A u can be made 
strictly proper by filtering each side of (8.5.4) with a first order stable filter 
without affecting the form of the parametric models (8.5.6) to (8.5.9). 



8.5. ROBUST ADAPTIVE LAWS 


579 


Bilinear Parametric Models 

Let us now extend the bilinear parametric model of Section 2.4.2 developed 
for the ideal plant to the plant that includes a multiplicative perturbation 
and a bounded disturbance, i.e. , consider the plant 

y = G 0 (s)(l + A m (s))u + d (8.5.10) 

where 

G ° (s) = 

Rp{s) 

k p is a constant, R p (s) is rnonic and of degree n, Z p (s) is rnonic Hurwitz of 
degree m < n and k p , Z p , R p satisfy the Diophantine equation 

R P Q + kpZpP = ZpA (8.5.11) 

where 

Q(s) = s n_1 + q T a n - 2 (s), P(s) = p T a n _i(s) 

and A(s) is a rnonic Hurwitz polynomial of degree 2n— 1. As in Section 2.4.2, 
our objective is to obtain a parameterization of the plant in terms of the 
coefficient vectors q,p of Q,P, respectively, by using (8.5.11) to substitute 
for Z p (s), R p (s ) in (8.5.10). This parameterization problem appears in direct 
MRAC where q,p are the controller parameters that we like to estimate on- 
line. Following the procedure of Section 2.4.2, we express (8.5.10) in the 
form 

RpU — kpZpU kpZpAmU Rpd 

which implies that 


QRpU = kpZpQu + kpZ p QA m u + QR p d (8.5.12) 


From (8.5.11) we have QR P = Z p A — kpZ p P, which we use in (8.5.12) to 
obtain 

Z p (A k p P)y — kpZ p Qu -t - kpZpQ^^ m u -(- QR p d 

Because Z p (s),A(s) are Hurwitz we can filter both sides of the above equa- 
tion with 1 /(Z p A) and rearrange the terms to obtain 


V = k p ^y + kp^u + kpQ^ 


u + 


QRp i 

AZ P 
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Substituting for P(s) = p T a n _i(s), Q(s) = s n 1 + q T a n - 2 (s), we obtain 


v - m k 

V ~ A{s) p 


T a n-l('S) 

P -H(^ y + q 


T a n-2(s) S 
~U i 


n— 1 


A(s) 


A(s 


-u 


+ T] 


(8.5.13) 


where A(s) is a Hurwitz polynomial of degree n\ that satisfies 2n — 1 > n\ > 
n — 1 and 

kpQAm QRp, / q r i/(\ 

V = — A — (8.5.14) 

is the modeling error resulting from A m ,d. We can verify that 
are strictly proper and biproper respectively with stable poles provided the 
overall plant transfer function is strictly proper. From (8.5.13), we obtain 
the bilinear parametric model 


y = W (s) p* (0* T (j) + z 0 ) + r] (8.5.15) 


where p* = k p , 


W{s) 


A W 
A{8) 


is a proper transfer function with stable poles and zeros and 


9* = [q T ,p T ] T , </» 


( s ) «,T-i(s) 1 

A(s) A(s) 


£0 


„n—l 


A(f 


If instead of (8.5.10), we use the plant representation with an additive plant 
perturbation, we obtain (8.5.15) with 


A — k p P 

V = -^—{AaU + d) 

and in the case of a plant with stable factor perturbations Ai, A 2 we obtain 
(8.5.15) with 


V = 


A Q 
AZ p 


(Ai u - A 2 y) + 


A Q 

AZp 



d 


where A is now restricted to have degree n. If we assume that the overall 
plant transfer function is strictly proper for the various plant representations 
with perturbations, then a general form for the modeling error term is 


r] = A U u + A y y + d\ 


(8.5.16) 
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where A u (s), A J/ (s) are strictly proper with stable poles and d\ is a bounded 
disturbance term. 


Example 8.5.1 Consider the following plant 

_ rs b ± s + b 0 1 - ps 

y = e -= u 

s z + ais + ao 1 + ps 

where p > 0, r > 0 are small constants. An approximate model of the plant is 
obtained by setting /i,r to zero, i.e., 


y = G 0 (s)u 


bis + b 0 
s 2 + ais + ag 


Treating Go(s) as the nominal or modeled part of the plant, we can express the 
overall plant in terms of Gq(s) and a multiplicative plant uncertainty, i.e., 


bis + bg 


^ s 2 + ais- 


a 0 


-(1 + A m (s))u 


where 


A m(^) 


(e~ TS - 1) - ns( 1 + e~ TS ) 
1 + y,s 


Let us now obtain a parametric model for the plant in terms of the parameter vector 
9* = [bi, bo, a\, ao] T . We have 


( s 2 + ais + a 0 )y = (bis + b 0 )u + (&is + b 0 ) A m ( s)u 


i.e., 


s 2 y = (b±s + b 0 )u - (ais + a 0 )y + ( bis + b 0 )A m (s)i 


Filtering each side with w e obtain 

z = 0* T (j) + y 


where 


2 = 


(s + 2) 


r2A <P = 


{s + 2) 2_U ’ (s + 2) 2 (s + 2) 2 y ’ (s + 2) 2; 


T u,~ 


V = 


(bis + b 0 ) [(e TS — 1) — /xs(l + e 


(s + 2) 2 


(1 + ns) 


V 


In the following sections, we use the linear and bilinear parametric models 
to design and analyze adaptive laws for estimating the parameter vector 0* 
(and p * ) by treating i) as an unknown modeling error term of the form given 
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by (8.5.16). To avoid repetitions, we will consider the general parametric 
models 

z = W(s)6* T i/> + ri (8.5.17) 

z = W(s)p*[9* T ip + z 0 ] + v (8.5.18) 

rj = A u (s)u + A y (s)y + d\ (8.5.19) 

where W(s) is a known proper transfer function with stable poles, and 
zo are signals that can be measured and A u ,A y are strictly proper stable 
transfer functions, and d\ £ Coo- 

We will show that the adaptive laws of Chapter 4 developed for the 
parametric models (8.5.17), (8.5.18) with rj = 0 can be modified to handle 
the case where r/ ^ 0 is of the form (8.5.19). 

8.5.2 SPR-Lyapunov Design Approach with Leakage 

Let us consider the general parametric model (8.5.17) which we rewrite as 

z = W(s)(9* T ^ + W~ 1 (s)v) 

and express it in the SPR form 

z = W(s)L(s)(e* T (/) + r}s) (8.5.20) 

where (J)= L~\s)^, rjs = L~\s)W~\s)v and L(s) is chosen so that W(s)L(s) 
is a proper SPR transfer function and L _1 (s) is proper and stable. The 
procedure for designing an adaptive law for estimating on-line the constant 
vector 9* in the presence of the unknown modeling error term 

Vs = L~ 1 (s)W~ 1 (s)(A u (s)u + A y (s)y + di) (8.5.21) 

is very similar to that in the ideal case (r/ s = 0) presented in Chapter 4. As 
in the ideal case, the predicted value z of z based on the estimate 9 of 9* 
and the normalized estimation error e are generated as 

z = W{s)L(s)9 T (j), e = z- z-W(s)L(s)en 2 s (8.5.22) 

where vn? = 1 + n 2 is the normalizing signal to be designed. From (8.5.20) 
and (8.5.22), we obtain the error equation 


e = WL[— 9 T cj) — en 2 + rjs] 


(8.5.23) 
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where 9 = 9 — 6*. Without loss of generality, we assume that WL is strictly 
proper and therefore (8.5.23) may assume the following minimal state rep- 
resentation 

e = A c e + B c [-9 T (j) - en 2 + r) s ] 
e = Cje. 

Because Cj ( sI—A c )~ 1 B c = W(s)L(s) is SPR, the triple (A c , B c , C c ) satisfies 
the following equations given by the LKY Lemma 

P C A C + AjP c = -qq T - is c L c 
P C B C = C c 


where P c = Pj > 0, q is a vector, u c > 0 is a scalar and L c = Lj > 0. The 
adaptive law is now developed by choosing the Lyapunov-like function 


V(e,0) 


e T P c e 9 t T~ 1 9 

— H 

2 2 


where T = T t > 0 is arbitrary, and designing 9 = 9 so that the properties 
of V allow us to conclude boundedness for V and, therefore, e, 9. We have 

V = — ^e T qq r e — ^-e T L c e — 9 T 4>e — e 2 n 2 + er/ s + 9 t T~ 1 9 (8.5.24) 


If r] s = 0, we would proceed as in Chapter 4 and choose 9 = Tecj) in order to 
eliminate the indefinite term — 0 T </>e. Because rj s ^ 0 and r] s is not necessarily 
bounded we also have to deal with the indefinite and possibly unbounded 
term er) s in addition to the term —9 T (f>e. Our objective is to design the 
normalizing signal m 2 = 1 + n 2 to take care of the term eij s . 

For any constant a £ (0, 1) to be specified later, we have 


2 2 , 2 2 i 2,2 

— e n s + erj s = —en s + er) s — ae + ae 


\ 2 2 i 2 2 2 , 

= —(1 — a)e n s + ae — ae m + ei] s 

= —(1 — a)e 2 n 2 + ae 2 — a(em — 


Vs 


2 am. 4am 2 


< —(1 — a)e 2 n 2 + ae 2 + 

which together with |e| < | C7 c 1 1 e | and 


Vs 


4 am. 2 


Vc T t ^ V c \min / T , i2 ^ ^min(Lc) o 

~J e 


4 
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imply that 

2 

V < — A 0 1 e| 2 - (/?o - a)e 2 - (1 - a)e 2 n 2 + “ ~° T ^ 

where Ao = ^ A m i n (L c ) , /3o = Ao/|C c | 2 . Choosing a = rnin( ! 2 , /3q), we obtain 

V < -A 0 |e| 2 - \e 2 n 2 s + - 0 T 0e + Fr" 1 # 

2 4am z 

If we now design the normalizing signal m 2 = 1 + n 2 so that r] s /m G /loo (in 
addition to 4>/m G Too required in the ideal case), then the positive term 
rjl/Iam 2 is guaranteed to be bounded from above by a finite constant and 
can, therefore, be treated as a bounded disturbance. 

Design of the Normalizing Signal 

The design of m is achieved by considering the modeling error term 
Vs = lT _1 (s)L _1 (s)[A„(s)« + A y (s)y + dt] 
and the following assumptions: 

(Al) W 1 (s)L 1 (s)is analytic in Refs] > — ^ for some known So > 0 
(A2) A u ,A y are strictly proper and analytic in Refs] > — ^ 

Because WL is strictly proper and SPR, it has relative degree 1 which 
means that W~ 1 L~ 1 A U , W~ l L~ x A y are proper and at most biproper trans- 
fer functions. Hence rj s may be expressed as 

Vs = ciu + c 2 y + Ai (s)u + A 2 (s)y + d 2 

where ci + Ai(s) = lT _1 (s)L _1 (s)A u (s), c 2 + A 2 (s) = lT' 1 (s)L _1 (s)A ?; (s), 
d 2 = W^ 1 (s)L~ 1 (s)d\ and Ai, A 2 are strictly proper. Using the properties 
of the £ 2< 5 -norm from Chapter 3, we have 


M*)| < c(\u(t)\ + \y{t)\) + ||Ai(s)|| 2( 5 0 |M| 2( 5 0 

+ ||A 2 (s)|| 25o ||yt|| 25o + \d 2 (t)\ 
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Because || Ai (s) 1 1 2 < 5 0 , 1 1 A 2 (s) 1 1 25 0 are finite and cfo £ Coo, it follows that the 
choice 

m 2 = l + n 2 s , n 2 s = m s + (f 1 (j) + u 2 + y 2 

m s = -5 0 m s + u 2 + y 2 , m s ( 0) =0 


will guarantee that r\ s /m e C^ and (f>/rn G C^. 

With the above choice for m, the term y 2 / (4am 2 ) € /loo which motivates 
the adaptive law 

d = Te<i>- vj(t)T6 (8.5.26) 

where T = T t > 0 is the adaptive gain and w(t) > 0 is a scalar signal to 
be designed so that V < 0 whenever V >Vq for some Vo > 0. In view of 
(8.5.26), we have 


V< -A 0 |e| 2 


1 


Vs 


-eW,-u,e T e+ w 


(8.5.27) 


which indicates that if w is chosen to make V negative whenever \6\ ex- 
ceeds a certain bound that depends on y 2 /(4 am 2 ), then we can establish 
the existence of Vq > 0 such that V < 0 whenever V > Vq. 


The following theorems describe the stability properties of the adaptive 
law (8.5.26) for different choices of w(t). 

Theorem 8.5.1 (Fixed a Modification) Let 


w(t) = cr > 0, Vi > 0 


where a is a small constant. The adaptive law (8.5.26) guarantees that 


(i) 9,e € Coo 

(ii) e,en s ,6 e S(a + y 2 /m 2 ) 


(iii) In addition to properties (i), (ii), ifn s , <fi, 4> 6 Coo, and 4> is PE with level 
ao > 0 that is independent of r/ s , then the parameter error 6 = 9 — 9* 
converges exponentially to the residual set 


D , J = 


\9\ < c(a + rj) | 


where c £ TZ + depends on oq and r/ = sup t \y s 
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Proof We have 


and 


-a6 l e < -er( 0 ' e- \o\ | 0 *|) < \e\ 2 + ^\9*\ 2 


-ae'e<-“- | 0 | 2 + ^|0*| 2 


Hence, for w = a, (8.5.27) becomes 


, 2^,2 _ ^2 1 / 3 * 

e n c a, ~ l9 T] s \u 


V < — A 0 |e| 2 — — — 10| 2 + c 0 -^T + ct- 




where Co = j- or 


(8.5.28) 


U < — f3V + cor] m + a -LJ — h (3 


e r P r e 


2 , -A o |e| 2 --|0| 2 (8.5.29) 


A 


where ry m = sup t and (3 is an arbitrary constant to be chosen. 

Because e T P c e+ 9 t T~ 1 9 < |e| 2 A maa: (P c ) + |0| 2 A maa ,(r _1 ), it follows that for 


/ 3 = min 


2An 


^ max {Pc) A maa: (r 1 ) 


(8.5.29) becomes 

in 2 

V < -(3V + c 0 rjm + a^— 

Hence, for V > Vo = 2cof i™+°\ e I ^ y < q which implies that V £ and therefore 
e, e, 9, 9 £ Loo- 


We can also use —a9 T 9 < — | \9\ 2 + ||0*| 2 and rewrite (8.5.27) as 


U< -A 0 |e| 2 - 


op 

2 2 


2 
Vs 
Co 2 


-\9* 


(8.5.30) 


Integrating both sides of (8.5.30), we obtain 

ct 


'to 


^o|c| H ^ + -|^| dr < 


'to 


* X Cn^- 
Co o 

m z 


\ 6 * 


dr + V(t 0 ) -V(t) 


Vt > to and any to > 0. Because V £ C , oo and e = Cje, it follows that e,e, en s , 
v^|0| € 5 (ct + rjl/m 2 ). Using the property — € Uoo, it follows from (8.5.26) with 
w = a that 

\9\ 2 < c{\em\ 2 + a\9\ 2 ) 

for some constant c > 0 and, therefore, e, en s , \/a\9\ £ S{a + ij 2 /m 2 ) and 9 £ C <*, 
imply that em and 9 £ S{a + ij 2 /m 2 ). 
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To establish property (iii), we write (8.5.23), (8.5.26) as 
e = A c e + B C {-6 T (f> - en 2 s ) + B c rj s 

0 = Tecf> — aT 9 (8.5.31) 

£ = Cje 

Consider (8.5.31) as a linear time-varying system with r/ s , ad as inputs and e, 9 as 
states. In Section 4.8.2, we have shown that the homogeneous part of (8.5.31) with 
g s = 0,<7 = 0 is e.s. provided that n a ,<f>, (f> £ £oo and f is PE- Therefore, the state 
transition matrix of the homogeneous part of (8.5.31) satisfies 

\Mt,t 0 )\\ < Pie- 02{t ~ to) (8.5.32) 

for any f, f 0 > 0 and some constants /3i , /?2 > 0. Equations (8.5.31) and (8.5.32) 
imply that 9 satisfies the inequality 

\9\ < Poe~ ,ht + /3 X f + a\0\)dT (8.5.33) 

Jo 

for some constants /3q,/3 1 > 0, provided that the PE property of <f> is not destroyed 
by iis ^ 0. Because 9 £ £oo, which is established in (i), and <j> £ C oo, it follows from 
(8.5.33) that 

\0\ < I3 o e~ 02t + ^~(v + a sup |0|) 

P2 t 

where fj = sup 4 \r) s (t)\. Hence, (iii) is proved by setting c = ^maa:{l,sup t |0|}. □ 


Theorem 8.5.2 (Switching a) Let 

f 0 if \e\ < Mo 

w(t) = a s , cr s = l - l ) 90 CT 0 if Mo < \9\ < 2M 0 
( a () if \6\ > 2M 0 

where qo > 1 is any finite integer and ao,Mo are design constants with 
Mo > 1 and do > 0. Then the adaptive law (8.5.26) guarantees that 

(i) 9, e £ Coo 

(ii) e, en s , 9 £ S(rjg/m 2 ) 


(iii) In the absence of modeling errors, i.e., when rj s = 0, property (ii) can 
be replaced with 
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(ii') e, en s , 9 £ C 2 . 

(iv) In addition, if n s ,(/>,(f> £ C 0 0 and (j) is PE with level ao > 0 that is 
independent ofr] s , then 

(a) 9 converges exponentially to the residual set 


D, = 


[9 \9\ < c(a 0 + ??) } 


where c £ IZ + and r] = sup t |r/ s | 

(b) There exists a constant 77 * > 0 such that for fj < fj* , the parameter 
error 9 converges exponentially fast to the residual set 


D» = 


\9\ < c?y| 


Proof We have 

a s e T e = *„(\0\ 2 - 0* r 6) > a s \9\{\9\ - M 0 + M 0 - \9 * |) 
Because cr s {\9\ — M 0 ) > 0 and M 0 > |0*|, it follows that 

o s 9 T 6 > a s \e\(\e\ - Mo) + CJ S \6\{M 0 - |r I) > <j s \9\(M 0 - \9* I) > 0 


i.e., 

7 \ r c\ 

^ ~ (8 ' 5 ' 34) 

The inequality (8.5.27) for V with w = o s can be written as 

V<-Xo\e\ 2 - e -^-a s e T 9 + c 0 ^ (8.5.35) 

z m z 

Because for |0| = |0 + 0*| > 2M 0 , the term —a s 9 T 9= — ao6 T 9 < — ^-\9\ 2 + ^\9*\ 2 
behaves as the equivalent fixed-cr term, we can follow the same procedure as in the 
proof of Theorem 8.5.1 to show the existence of a constant Vo > 0 for which V < 0 
whenever V > Vq and conclude that V, e, e,9,9 £ Coo. 

Integrating both sides of (8.5.35) from to to t, we obtain that e, e, en s , em, 
\fcrjH9 £ S(gl/m 2 ). From (8.5.34), it follows that 

a 2 s \9\ 2 < c 2 o s 9 T 9 

for some constant c 2 > 0 that depends on the bound for o 0 \9\, and, therefore, 

\6 \ 2 < c(|em | 2 + cr s 9 T 9), for somec € IZ + 
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Because cm, \J a s 9 T 9 G S(r] 2 /m 2 ), it follows that 9 G S(ri 2 /m 2 ). 

The proof for part (iii) follows from (8.5.35) by setting r] s = 0, using —a s 9 T 9 < 0 
and repeating the above calculations for r/ s = 0. 

The proof of (iv)(a) is almost identical to that of Theorem 8.5.1 (iii) and is 
omitted. 

To prove (iv)(b), we follow the same arguments used in the proof of Theorem 
8.5.1 (iii) to obtain the inequality 

\0\ < f3 0 e~ f32t [ e~ f32 ^ t ~ T \\r]s\ + cr s \0\)dT 

Jo 

rt 

— ftn ( i — -r'l i n\ , 

It 


< Po e- p2t +^fj + p[ [ e- 0 ^- T) a s \6\d7 

P 2 Jo 

From (8.5.34), we have 

<T S \0\ < 


-a s 9 T 6 < 


M-\e*\ ~ m - \e 

Therefore, using (8.5.37) in (8.5.36), we have 

rt 


^s\e\ \e\ 


(8.5.36) 


(8.5.37) 


(8.5.38) 


\e\<Poe- 02t +^fj + P'i [ \9\dr 

P2 Jo 

where Applying B-G Lemma III to (8.5.38), it follows that 

\9\ < {Po + ^fpe-^-^e 1 f*o asl0lds +0^ f e-Mt-r)/' JW s \e\ds dr (8 5 39) 
P2 Jt 0 

Note from (8.5.34), (8.5.35) that ^a s \6\ G S(r/ 2 /m 2 ), i.e., 

f <Ts\9\dT < Cifj 2 {t - to) + Co 

Jtn 


'to 

Vi > to > 0 and some constants cq,Ci. Therefore, 


\6\ < ^ie- 5(t "‘ o) + p 2 fj f e"“ (t - T W 

Jto 


(8.5.40) 


where a = p 2 — P 2 c±r] 2 and Pi,p 2 > 0 are some constants that depend on Co and 
the constants in (8.5.39). Hence, for any fj G [0,77*), where fj* = we have 

a > 0 and (8.5.40) implies that 


\9\ < — fj + ce" a(t - to) 
a 

for some constant c and for all t > to > 0. Therefore the proof for (iv) is complete. 

□ 
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Theorem 8.5.3 (e-Modification) Let 

w(t) = \e 7 n\ 1 so 

where uo > 0 is a design constant. Then the adaptive law (8.5.26) with 
w(t) = \em)no guarantees that 

(i) 0, e G Too 

(ii) e,en s ,9 e S(n 0 + ^/m 2 ) 

(iii) In addition, if n s ,(j>,(f) G Too and <t> Is PE with level «o > 0 that is 
independent of rj s , then 9 converges exponentially to the residual set 

D e = {(9 \e\ < c(u 0 + fj ) | 

where c G IZ + and fj = sup t |r/ s |. 

Proof Letting w(t) = |e?n|^o and using (8.5.26) in the equation for V given by 
(8.5.24), we obtain 

V < ~v c e — cC — e 2 n 2 + |em|-^- — \em\v 0 9 T 9 
2 TO 

Because — 9 T 9 < — it follows that 

V < — 2A 0 |e| 2 - e 2 n 2 - \em\ ^ (8.5.41) 
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By completing the squares and using the same approach as in the proof of The- 
orem 8.5.1, we can establish that e,en s ,em G S(v o + rj 2 /m 2 ), which, together 
with \6\ < ||r|||em|^ + z'o||r|||em||0| < c|em| for some c G 1Z + , implies that 
0 G S(V 0 + V 2 s /m 2 ). 

The proof of (iii) is very similar to the proof of Theorem 8.5.1 (iii) , which can 
be completed by treating the terms due to rj s and the e-modification as bounded 
inputs to an e.s. linear time-varying system. □ 

Remark 8.5.1 The normalizing signal m given by (8.5.25) involves the dy- 
namic term m s and the signals 4>, u, y. Under some conditions, the 
signals (/> and/or u,y do not need to be included in the normalizing 
signal. These conditions are explained as follows: 

(i) If <f> = H(s)[u,y\ T where H(s) has strictly proper elements that are 
analytic in Re[s] > —Sq/2, then 1+ < ^ G and therefore the term 
<j) T (j) in the expression for n 2 can be dropped. 

(ii) If W(s)L(s) is chosen to be biproper, then W~ 1 L~ 1 A U , W~ 1 L~ 1 A y 
are strictly proper and the terms u 2 , y 1 in the expression for n 2 s can be 
dropped. 

(iii) The parametric model equation (8.5.20) can be filtered on both sides 
by a first order filter s ^ q where /o > ^ to obtain 

Zf = W (s)L(.s)(9* t (j)f + rjf) 

/\ r 

where Xf = denotes the filtered output of the signal x. In this 

case 

rjf = L~ 1 (s)IU~ 1 (s) [A u (s)u + A y (s)y + di] 

s + Jo 

is bounded from above by m 2 given by 

2 i i 2 2 

m = l + n s , n s = m s , 

m s = -6 0 m s + u 2 + y 2 , m s ( 0) = 0 

The choice of m is therefore dependent on the expression for the mod- 
eling error term 77 in the parametric model and the properties of the 
signal vector (j). 
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Remark 8.5.2 The assumption that the level ao > 0 of PE of (j) is indepen- 
dent of Tj s is used to guarantee that the modeling error term ij s does 
not destroy or weaken the PE property of <j). Therefore, the constant 
/?2 in the bound for the transition matrix given by (8.5.32) guaranteed 
to be greater than zero for rj s = 0 is not affected by rj s ^ 0. 


8.5.3 Gradient Algorithms with Leakage 


As in the ideal case presented in Chapter 4, the linear parametric model 
with modeling error can be rewritten in the form 

z = d* T (j) + ri, 7] = A u (s)u + A y (s)y + d\ (8.5.42) 


where ej) = W{s)^. The estimate z of z and the normalized estimation error 
are constructed as 



z — z 

m 2 


z — 9 t 4> 

m 2 


(8.5.43) 


where 9 is the estimate of 9* and m 2 = 1 + n 2 and n s is the normalizing 
signal to be designed. For analysis purposes, we express (8.5.43) in terms of 
the parameter error 9 = 9 — 9*, i.e., 


If we now design m so that 


—9^(f) + r] 

(8.5.44) 

( 9 

m- 

± il p r 

m i m. c -‘-'oo 

(Al) 


then the signal em is a reasonable measure of the parameter error 9 since 
for any piecewise continuous signal 4> an d rj, large em implies large 9. 


Design of the Normalizing Signal 

Assume that A u (s),A y (s) are strictly proper and analytic in Re[s] > — So/2. 
Then, according to Lemma 3.3.2, we have 

< IIA^HmoMmo + ll A y('S)||25ol|yt||25o + ll d l||25 0 

which motivates the following normalizing signal 

m 2 = 1 + n 2 , n 2 = m s + (j) T 4> 

= -5 0 m, s + u 2 + y 2 , m s ( 0) = 0 


m 


(8.5.45) 
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that satisfies assumption (Al). If <f> = H(s)[u, y] T where all the elements of 
H(s) are strictly proper and analytic in Re[s\ > — y, then 1+ ^ 6 C ^ and 
the term 0 T </> can be dropped from the expression of n 2 without violating 
condition (Al). 

If instead of being strictly proper, A u , A y are biproper, the expression 
for n 2 s should be changed to 

n 2 s = 1 + u 2 + y 2 + (j) T (j) + m s 


to satisfy (Al). 

The adaptive laws for estimating 9* can now be designed by choosing 
appropriate cost functions that we minimize w.r.t. 9 using the gradient 
method. We start with the instantaneous cost function 


J(M = 


e 2 m 2 


+ 


wit ) , 


|T / 


(z~ 

2 m 2 


T < t >) 2 + 


where w(t) > 0 is a design function that acts as a weighting coefficient. 
Applying the gradient method, we obtain 


9 = -TVJ = Te<t>-wT0 (8.5.46) 

em 2 = z — 8 T (j) = —8 T (j) + rj 


where T = T t > 0 is the adaptive gain. The adaptive law (8.5.46) has the 
same form as (8.5.26) except that e and 4> are defined differently. The weight- 
ing coefficient iv(t) in the cost appears as a leakage term in the adaptive law 
in exactly the same way as with (8.5.26). 

Instead of the instantaneous cost, we can also use the integral cost with 
a forgetting factor (3 > 0 given by 


jm = i j‘ [z(T> + i w(t )a T (t)o(t) 


where w(t) > 0 is a design weighting function. As in Chapter 4, the appli- 
cation of the gradient method yields 


= -rv j = r 


a -P(t-r) l Z i T ) ~ ° T jt)(l>(T)] 
m 2 (r) 


( t>{r)dT 


Tw8 
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which can be implemented as 

9 = -T(R6 + Q) -TwO 

R = -/3-R+^r, R( 0) = 0 (8.5.47) 

m z 

Q = -PQ-% Q(o) = o 

where R £ IZ nxn , Q £ IZ nxl and n is the dimension of the vector <f. 

The properties of (8.5.46), (8.5.47) for the various choices of w(t) are 
given by the following Theorems: 

Theorem 8.5.4 The adaptive law (8.5.46) or (8.5.47) guarantees the fol- 
lowing properties: 

(A) For the fixed-cr modification, i.e., w(t) = a > 0, we have 


(i) e,en s ,0,9 <E Coo 

(ii) e,en s ,9 £ S(a + rj (i) 2 /m 2 ) 

(iii) If n s ,4> £ Coo a,nd 4> is PE with level ao > 0 independent of g, 
then 9 converges exponentially to the residual set 


D a 


1 9 \9\ < c(a + 



where g = sup t 


\m\ 

m(t) 


and c> 0 is some constant. 


(B) For the switching-er modification, i.e., for w(t) = a s where a s is as 
defined in Theorem 8.5.2, we have 


(i) e,en s ,9,9 <E Coo 

(ii) e,en s ,9 £ S{if/m 2 ) 

(iii) In the absence of modeling error, i.e., for i] = 0, the proper- 
ties of (8.5.46), (8.5.47) are the same as those of the respective 
unmodified adaptive laws ( i.e., with w = 0) 



8.5. ROBUST ADAPTIVE LAWS 


595 


(iv) If n s ,4> G Cm and <t> PE with level ao > 0 independent of rj, 
then 

(a) 6 converges exponentially fast to the residual set 

E> s = \0 \9\ < c(cr 0 + ??) | 
where c > 0 is some constant. 

(b) There exists an fj* > 0 such that if fj < fj* , then 6 converges 
exponentially to the residual set D s = Id ||0| < cij\. 


Proof We first consider the gradient algorithm (8.5.46) based on the instantaneous 
cost function. The proof of (i), (ii) in (A), (B) can be completed by using the 
Lyapunov- like function V{9) = — ^ — - and following the same procedure as in the 
proof of Theorems 8.5.1, 8.5.2. 

To show that the parameter error converges exponentially to a residual set for 
persistently exciting </>, we write (8.5.46) as 

'e = -r^e+ ^ - w (t)re (8.5.48) 

m~ m z 

In Section 4.8.3 (proof of Theorem 4.3.2 (iii)), we have shown that the homogeneous 
part of (8.5.48) with g = 0 and w = 0 is e.s., i.e., the state transition matrix of the 
homogeneous part of (8.5.48) satisfies 

II^Mo)!! </?i e-^-to) 
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Using (8.5.51) in (8.5.50), we have 


\8\ < ^ + & f e-^‘- r V s \9\ \e\dr (8.5.52) 

P2 JO 

with /3j = mJ-\ e*| • Applying B-G Lemma III, we obtain 

| 0 | < e-^-^e 0 " £ aMdS + fa J* It "° Wds dr 

(8.5.53) 

Now consider the Lyapunov function V(9) = A-L. — Along the solution of the 
adaptive law given by (8.5.46) with w = cr s , we have 


v=-V^ + 1 r± _ ^ 


(8.5.54) 


for some constant c > 0, where the second inequality is obtained by using the 
properties that 6, A € £oo- Equations (8.5.51) and (8.5.54) imply that 


0's 1^1 < 


cry — V 


M 0 -\e*\ 

which, together with the boundedness of V, leads to 

f a s \9\dr < Cirj(t — to) + Cq (8.5.55) 

Jto 

for all t > t 0 > 0 and for some constants Ci, Co- Using (8.5.55) in (8.5.53), we have 
\0\ < ^ 0 e _5(t_to) + fa [ e"“ (t - T W 

Jtn 


(8.5.56) 


where a = @2 — Pi Ciry. Therefore for ry* = » 2 and ry < ry*, we have a > 0 and 

Pi C 1 

w<^+„ 

a 

where e* is an exponentially decaying to zero term and the proof of (B)(iv) is 
complete. 

The proof for the integral adaptive law (8.5.47) is different and is presented 
below. From (8.5.47), we can verify that 

Q(t) = -R(t)6* - f e-0WA^S- v (r)dr 

Jo m i T ) 
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and therefore 


9 = 9 = -r R9 + T / e 




2 (r) 


dr — Tred 


Let us now choose the same V(9) as in the ideal case, i.e., 

v(») - ^ 


(8.5.57) 


Then along the solution of (8.5.57), we have 

V = -9 T R9 + 9 T {t) [ e -/3(*-r) d T - w 6 t 9 

Jo m\T) 


< -9' R9 + 

-w9 T 9 


n - 0( t- T) (9 T (f)<Xu)) 2 

to 2 (t) 


V 2 (r) 


m 2 (r) 


dr 


(8.5.58) 


where the inequality is obtained by applying the Schwartz inequality. Using 

(rw^T)) 2 = 9 T (mr)t T (T)m 


and the definition of the norm || • || 2( 5 , we have 


V < ~9 r R9 + (9 r R9)i ( ' ), 

TO 


-w 9 t 6 


2/3 


Using the inequality —a 2 + ab < — \ we obtain 


V < — 


9 t R9 1 


(— )t 
TO 


2/3 


-u/0 T (9 


(8.5.59) 


Let us now consider the following two choices for w: 

(A) Fixed a For w = a, we have —a9 T 9 < — § \9\ 2 + ||0*| 2 and therefore 


V < 


e T R9 

2 


d |( 9|2 , a \ a *\2 


2> r 


(-)t 

TO 



(8.5.60) 


Because R = R T > 0, cr > 0 and ||(m)i|| 2(3 < c m for some constant c m > 0, it 

follows that V < 0 whenever V ( 9 ) > Vo for some Vo > 0 that depends on fj, a and 
\9*\ 2 . Hence, V, 9 £ Coo which from (8.5.44) implies that e,em,en s € Coo- Because 
R,Q € Coo and 9 € C Q c , we also have that 9 € Coo ■ 
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Integrating on both sides of (8.5.60), we obtain 


1 


1 _ ip 

(9 T Re + a\9\ 2 )dT < V(to)-V(t) + - I a\9*\ 2 dr 


'to 

„-0(t-s) V 2 ( S ) 


t 0 JO 


rrv 


\s) 


dsdr 


By interchanging the order of the double integration, i.e., using the identity 


t rr 


f(r)g(s)dsdT = 


’to JO 

we establish that 


f(T)g(s)dTds 


f(r)g(s)dTds 


io Jt 0 


* V 2 (s) . . 1 


to J 0 


>(s) 


dsdr < — 


'' Ldr , C 


P Jtn P 2 


(8.5.61) 


2 / a 

where c = sup ( ^(t) ’ which together with V G imply that 


R*6, Ja\9\ G S(a + rf/m?) 


It can also be shown using the inequality —ad T d < — | \8\ 2 + f |0*| 2 that JE\9\ G 
S(a + rj 2 /m 2 ). To examine the properties of e, en s we proceed as follows: We have 


4- 9 t R 9 = 2 9 T R9 + 9 T R9 
at 

= -29 t RTR 9 + 29 t RT 


-2<t9 1 RT9 - f39 1 R9 + 


'* 0(t - T ) <Kt)ti(t) 

m 2 (r) 

(0 T 0) 2 


dr 


m * 


where the second equality is obtained by using (8.5.57) with w = a and the expres- 
sion for R given by (8.5.47). Because e = t +r> , it follows that 


d _ _ _ p 

aT R9 = -29 t RTR 9 + 29 t RT 


dt 


- 0(t -r) HrHr) 
m 2 {r) 

-2 <t9 t RT9 - /39 T R9 + (, em - J-) 2 


Using (em — ^) 2 > Mr we obtain the inequality 


2 


< — 9 t R9 + 29 t RTR 9-29 t RT [ e -/3(t- T ) d T 

dt J o m z {T ) 

n 2 

+ 2a9 T RT9 + (39 1 R9 + -d— 

m z 


(8.5.62) 
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Noting that 


2\9'RT e 


-0(t-T) 

m 2 (r) 


< |Fi?r| 2 + / e -/3(t-r)MWJ 2 dr f e- 0{t ~ T)T ^\dT 

Jo m-(T) J 0 m z (t ) 

where the last inequality is obtained by using 2 ab < a 2 + b 2 and the Schwartz 
inequality, it follows from (8.5.62) that 


— < — e T Re + 2e T RTRe + 2ad T Rre + pe T R.e 


Because R^9, \/<j\9\ G S(a + rj 2 /m 2 ) and ^,9,R € C oo, it follows from the above 
inequality that e, en s ,em € S(a + rj 2 /m 2 ). Now from (8.5.57) with w = c r we have 


\9\ < ||I\R5|||/?2 0| + fT ||r|||0| 


e-^mdr 


2 ,Wr,«M V 2 (r) 


which can be used to show that 9 G 5(<r + rj 2 /m 2 ) by performing similar manipu- 
lations as in (8.5.61) and using \/cr\9\ G S (cr + g 2 / m 2 ) . 

To establish the parameter error convergence properties, we write (8.5.57) with 


w = <j as 


~9 = -TR(t)9 + T [ e- 0{t - T) ^dT~Ta9 


(8.5.63) 


In Section 4.8.4, we have shown that the homogeneous part of (8.5.63), i.e., 9 = 
— TR(t)9 is e.s. provided that </> is PE. Noting that fg ^ dr < crj for some 

constant c > 0, the rest of the proof is completed by following the same procedure 
as in the proof of Theorem 8.5.1 (iii). 

(B) Switching a For w(t) = a s we have, as shown in the proof of Theorem 8.5.2, 


cr s 9 T 9 > a s \9\(M 0 - \9*\) > 0, i.e., cr s \9\<a s 


and for \9\ = \9 + 9* \ > 2Mq we have 


M 0 - \9* 


-a s 6 T 9<-^\9\ 2 +°JJ\9*\ 2 


(8.5.64) 



600 


CHAPTER 8. ROBUST ADAPTIVE LAWS 


Furthermore, the inequality (8.5.59) with w(t) = a s can be written as 


V < 


0 t R6 

2 


1 

2 




- a s e T 0 


(8.5.65) 


Following the same approach as the one used in the proof of part (A), we can 
show that V < 0 for V > Vo and for some Vo > 0, i.e., V € Coo which implies that 
e, 6 , en s £ Coo- Integrating on both sides of (8.5.65), we obtain that Ri 9, \J a s 6 T 9 £ 
S(r] 2 /m 2 ). Proceeding as in the proof of part (A) and making use of (8.5.64), we 
show that e, en s £ S(ri 2 /m 2 ). Because 

a 2 \8\ 2 < cia s 9 T 9 


where Ci £ 1Z + depends on the bound for (Jo\9\, we can follow the same procedure 
as in part (A) to show that 8 £ S(?f/m 2 ). Hence, the proof for (i), (ii) of part (B) 
is complete. 

The proof of part (B) (iii) follows directly by setting ij = 0 and repeating the 
same steps. 

The proof of part (B) (iv) is completed by using similar arguments as in the 
case of part (A) (iii) as follows: From (8.5.57), we have 


9=WR(t)9 + T [ e ^ T ' 1 — Tw9, w = a s (8.5.66) 

Jo m ~ 

In Section 4.8.4, we have proved that the homogeneous part of (8.5.66) is e.s. if 
4> £ Coo and <f> is PE. Therefore, we can treat (8.5.66) as an exponentially stable 
linear time-varying system with inputs <j s 0, f* ^dr. Because ^ < f/ and 

G £oo, we have 


: e-M-T)tn d T 

m z 


< cr] 


for some constant c > 0. Therefore, the rest of the parameter convergence proof 
can be completed by following exactly the same steps as in the proof of part (A) 
(iii). □ 


e-Modification 

Another choice for w(t) in the adaptive law (8.5.46) is 

w(t) = \e7n\iso 

where uq > 0 is a design constant. For the adaptive law (8.5.47), however, 
the e-modification takes a different form and is given by 

W{t) = U 0 \\ e(t, • ) 777,( • ) 1 1 1 2/3 
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Theorem 8.5.5 Consider the adaptive law (8.5.46) with w = \em\vo and 
the adaptive law (8.5.47) with w(t) = vo\\e(t, -)? 77 .(-) H2/3- Both adaptive laws 
guarantee that 


(i) e,en s ,9,9 <E Coo. 

(ii) e,en s ,9 G S(n 0 + rj 2 /m 2 ). 

(iii) If n s , <f> £ £00 and </> is PE with level ao > 0 that is independent of rj, 
then 9 converges exponentially fast to the residual set 

D e = {$ \9\ < c(uq + 77 ) | 
where c > 0 and fj = sup t ^ . 

Proof The proof of (i) and (ii) for the adaptive law (8.5.46) with w(t) = v 0 \em.\ 
follows directly from that of Theorem 8.5.3 by using the Lyapunov-like function 
V = ® Tr 2 — . The time derivative V of V = ^ r 2 ^ along the solution of (8.5.47) 
with w(t) = * ) 7T?. ( * ) H 2/3 becomes 

= -e T [R.e + Q\ -wd T e 

= -e T (t) \ - ^)} d ; 

Jo m z (r) 

- \W,-)m{-)\\ 2 pi>o6 T 9 


V 
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Using e(t,r)m 2 (r) = z(r) — 8 T (t)<j>(T) = — 9 T {t)<j){r) + rj(r) we have 
rt 


V = 


< 


— e ^ T ' ) e(t, r)[e(t, T)m 2 (T) — ry(r)]dr — 

J o 

- (lk(*)-M-)|| 2 ^) + (^)t 


- ||e(VM-)ll 2 /3 


|e(t,-)m(-)|| 2/3 - ( — )t 

H 7T7 


2/3 


||e(i, -)^(-)ll 2/ 3 ^o<9 T <9 

- Mt,-)m(-)\\ 2l3 v o 0 T 0 

2/3 ' 

+ v 0 e T e\ 


where the second term in the inequality is obtained by using the Schwartz inequality. 
Using —6* 0 < — tJ_ _|_ J__J_ and the same arguments as in the proof of Theorem 8.5.3, 
we establish that V,e,en s ,0,6 € Loo- From (8.5.59) and w(t) = z/o||e(f, -)r7i( •) || 2 / 3 , 
we have 


V < 

< 


8 T R8 

2 

2 



||(e(f, •) r?z ('))t || 2 /3 9 

||(6(t,-)m(-)) t || 2 ^o(^ 


\ 6 * 


2 


) 


Integrating on both sides of the above inequality and using the boundedness of V, 
em and (8.5.61) we can establish that R^6, ^/vq ^|| (e(t, r)m(r))t|| 2/9 ^ 2 \9\ € S(v 0 + 

rj 2 /m 2 ). Following the same steps as in the proof of Theorem 8.5.3, we can conclude 
that e, en s , 9 € S(vq + ij 2 /m 2 ) and the proof of (i), (ii) is complete. 

Because em, ||e(t, -) TO (')t|| 2/3 € Loo, the proof of part (iii) can be completed by 
repeating the same arguments as in the proof of (A) (iii) and (B) (iv) of Theo- 
rem 8.5.4 for the a- modification. □ 


8.5.4 Least-Squares with Leakage 


The least-squares algorithms with leakage follow directly from Chapter 4 by 
considering the cost function 


mt) = + ,(r)e\mr)^r 


+ \e~^{9 - O o ) t Q o (0 - 0 o ) 


(8.5.68) 


where (3 > 0 is the forgetting factor, w(t) > 0 is a design weighting function 
and m{t) is as designed in Section 8.5.3. Following the same procedure as 
in the ideal case of Chapter 4, we obtain 


0 = P(e<j) — w6) 


(8.5.69) 
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P = /3P-P^P, p(o) = J P 0 = Qo 1 ( 8 - 5 . 70 ) 

where Po = Po > 0 and (8.5.70) can be modified when /3 = 0 by using 
covariance resetting, i.e., 

P = -P^P, P(t+) = P 0 = p 0 I (8.5.71) 

where t r is the time for which A m j n (P(t)) < pi for some po > Pi > 0- When 
(3 > 0, (8.5.70) may be modified as 


P = 


pp-pi^p if ||P(f)|| < R 0 
0 otherwise 


(8.5.72) 


with 1 1 Po| | < Ro for some scalar Po > 0. 

As we have shown in Chapter 4, both modifications for P guarantee that 
PP- 1 £ Coo and therefore the stability properties of (8.5.69) with (8.5.71) 
or (8.5.72) follow directly from those of the gradient algorithm (8.5.46) given 
by Theorems 8.5.4, 8.5.5. These properties can be established for the various 
choices of the leakage term w{t) by considering the Lyapunov-like function 
V (9) = ^ Tj .( — where P is given by (8.5.71) or (8.5.72) and by following the 
same procedure as in the proofs of Theorem 8.5.4 to 8.5.5. The details of 
these proofs are left as an exercise for the reader. 


8.5.5 Projection 

The two crucial techniques that we used in Sections 8.5.2 to 8.5.4 to develop 
robust adaptive laws are the dynamic normalization m and leakage. The 
normalization guarantees that the normalized modeling error term p/m is 
bounded and therefore acts as a bounded input disturbance in the adaptive 
law. Since a bounded disturbance may cause parameter drift, the leakage 
modification is used to guarantee bounded parameter estimates. Another 
effective way to guarantee bounded parameter estimates is to use projec- 
tion to constrain the parameter estimates to lie inside some known convex 
bounded set in the parameter space that contains the unknown 9*. Adaptive 
laws with projection have already been introduced and analyzed in Chapter 
4. In this section, we illustrate the use of projection for a gradient algorithm 
that is used to estimate 9* in the parametric model (8.5.42): 

z = 6* T p + r), rj = A u (s)u + A y (s)y + d\ 
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To avoid parameter drift in 8, the estimate of 9*, we constrain 9 to lie 
inside a convex bounded set that contains 9* . As an example, consider the 
set 

V = { 9\g{9 ) = 9 t 9 - M 0 2 < o} 

where Mq is chosen so that Mo > |0*|. The adaptive law for 9 is obtained 
by using the gradient projection method to minimize 

= (z - 9 T (f>) 2 
2 m 2 

subject to 9 £ V, where m is designed as in the previous sections to guarantee 
that 4>/m, r)/m £ Too- 

Following the results of Chapter 4, we obtain 
f Te<f> if 9 T 9 < Ml 

8 = < or if 9 t 9 = Mg and (Te0) T 0 < 0 (8.5.73) 

1 (-^ ~ qt pg )Te0 otherwise 

where 0(0) is chosen so that 0 T (O)0(O) < Mg and e = ^ , T = T t > 0. 

The stability properties of (8.5.73) for estimating 9* in (8.5.42) in the 
presence of the modeling error term r) are given by the following Theorem. 

Theorem 8.5.6 The gradient algorithm with projection described by the 
equation (8.5.73) and designed for the parametric model (8.5.42) guaran- 
tees that 

(i) e,en s ,9,9 <E Coo 

(ii) e,en s ,9 £ S{p 2 /m 2 ) 

(iii) If r] = 0 then e, en s , 9 £ C 2 

(iv) Ifn s ,4> £ Too an( i ^ is PE with level «o > 0 that is independent of g, 
then 

(a) 9 converges exponentially to the residual set 

T> p ={0 \6\ < c(/ 0 + fj)} 

where fj = sup t c > 0 is a constant and /o > 0 is a design 
constant. 
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(b) There exists an if > 0 such that if g < rf , then 6 converges 
exponentially fast to the residual set 


D p = 


1 9 \9\ < cr/| 


for some constant c > 0. 


Proof As established in Chapter 4, the projection guarantees that \9(t) \< Mq, Vt > 0 
provided |0(O)| < Mq. Let us choose the Lyapunov-like function 


V = 


(Fr- 1 # 

2 


Along the trajectory of (8.5.73) we have 


( —e 2 m 2 + eg if 9 T 9 < Mf 

V = i or if 9 t 9 = Mq and (Te(f>) T 9 < 0 

[ —e 2 m 2 + eg — ^rj^0 T re(/> if 0 T 9 = Mq and (Te0) T 0 > 0 

For 9 t 9 = Mq and (re^) T 0 = 9 T Tecj) > 0, we have sgn j gT r ^ e< ^ | = sgn{0 T 0}. 
For 9 r 9 = Mq, we have 

9 t 0 = 9 T 9 - 9* t 9 > Mq - \9*\\9\ = M 0 (M 0 - |6»*|) > 0 


where the last inequality is obtained using the assumption that Mq > |0*|. There- 
fore, it follows that ^ gr r g e ^ > 0 when 9 r 9 = Mq and {Tef) 1 9 = 9 T Tecj) > 0. 
Hence, the term due to projection can only make V more negative and, therefore, 


V = —e 2 rn 2 + eg < 


2 2 
era 


2 



m 2 


Because V is bounded due to 9 € Coo which is guaranteed by the projection, it 
follows that em € S{g 2 /m 2 ) which implies that e, en s G S(g 2 /m 2 ). From 9 G Coo 

and (f>/m, g/m G Coo, we have e, en s G Coo- Now for 9 T 9 = Mq we have < c 

for some constant c > 0 which implies that 


\9\ < c\eej>\ < c|em| 


Hence, 9 G S(g 2 /m 2 ) and the proof of (i) and (ii) is complete. The proof of part 
(iii) follows by setting g = 0, and it has already been established in Chapter 4. 

The proof for parameter error convergence is completed as follows: Define the 
function 


f if if 9 T 9 = Mq and ( Vef) T 9 > 0 

1 0 otherwise 


(8.5.74) 
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It is clear from the analysis above that f(t) > 0 Vt > 0. Then, (8.5.73) may be 
written as 

6 = Te(j)-TfO. (8.5.75) 

We can establish that fd T d has very similar properties as cr s 9 T 9 , i.e. , 

m \e\>f9 T e>f\9\(M 0 -\9*\), /> 0 

and |/(t)| < /o Vf > 0 for some constant /o > 0. Therefore the proof of (iv) (a), 
(b) can be completed by following exactly the same procedure as in the proof of 
Theorem 8.5.4 (B) illustrated by equations (8.5.48) to (8.5.56). □ 

Similar results may be obtained for the SPR-Lyapunov type adaptive 
laws and least-squares by using projection to constrain the estimated pa- 
rameters to remain inside a bounded convex set, as shown in Chapter 4. 


8.5.6 Dead Zone 

Let us consider the normalized estimation error 

z — 9 T (j) — 0 T (f> + 

m 2 rri 2 

for the parametric model 


(8.5.76) 


z = 9* T (j) + rj, = A u (s)u + A y (s)y + di 

The signal e is used to “drive” the adaptive law in the case of the gradient 
and least-squares algorithms. It is a measure of the parameter error 8, which 
is present in the signal 9 T (j), and of the modeling error rj. When rj = 0 and 
9 = 0 we have e = 0 and no adaptation takes place. Because ^ £ £oo, 
large em implies that — - is large which in turn implies that 9 is large. 
In this case, the effect of the modeling error rj is small and the parameter 
estimates driven by e move in a direction which reduces 9. When em is small, 
however, the effect of rj may be more dominant than that of the signal 9 T (j) 
and the parameter estimates may be driven in a direction dictated mostly 
by rj. The principal idea behind the dead zone is to monitor the size of the 
estimation error and adapt only when the estimation error is large relative 
to the modeling error rj, as shown below: 
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f(e) = 

e + g ^ 

— 90 
m 



Oil 

/ ~ 

m 


(a) 



Figure 8.6 Normalized dead zone functions: (a) discontinuous; (b) con- 
tinuous. 


We first consider the gradient algorithm for the linear parametric model 
(8.5.42). We consider the same cost function as in the ideal case, i.e., 




2 

e m 


2 


2 


and write 



if \em\ > g 0 > ^ 
otherwise 


(8.5.77) 


In other words we move in the direction of the steepest descent only when the 
estimation error is large relative to the modeling error, i.e., when |em| > go, 
and switch adaptation off when era is small, i.e., |e?n| < go- In view of 
(8.5.77) we have 


6 = T(j)(e + g), g = 


0 if |em| > go 
— e if |em| < go 


(8.5.78) 


To avoid any implementation problems which may arise due to the disconti- 
nuity in (8.5.78), the dead zone function is made continuous as follows: 


9 = T(j){e + g), g = 


d if em < -go 
if era > go 
— e if |era| < go 


(8.5.79) 


The continuous and discontinuous dead zone functions are shown in Fig- 
ure 8.6(a, b). Because the size of the dead zone depends on m, this dead 
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zone function is often referred to as the variable or relative dead zone. Sim- 
ilarly the least-squares algorithm with the dead zone becomes 

6 = P<t)(e + g) (8.5.80) 


where e,g are as defined in (8.5.79) and P is given by either (8.5.71) or 
(8.5.72). 

The dead zone modification can also be incorporated in the integral adap- 
tive law (8.5.47). The principal idea behind the dead zone remains the same 
as before, i.e. , shut off adaptation when the normalized estimation error is 
small relative to the modeling error. However, for the integral adaptive law, 
the shut-off process is no longer based on a pointwise-in-time comparison of 
the normalized estimation error and the a priori bound on the normalized 
modeling error. Instead, the decision to shut off adaptation is based on the 
comparison of the £26 norms of certain signals as shown below: 

We consider the same integral cost 

j(M) = - T H£m£I dT 

2 Jo mHr 


as in the ideal case and write 


-TVJ(60 if || e(t, -MO || 2/9 > 9o > sup t \\(^)thi3 
0 otherwise 


(8.5.81) 


where f3 > 0 is the forgetting factor, v > 0 is a small design constant, 

^ A z(t) - 0 T (t)</>(r) 

^(^5 W Of ^ 

m z [r) 

and ||e(i, -)m(-)|| 2/3 = [Jq e~^^ r ' ) e 2 (t, r)m 2 (r)(ir] 1 / 2 is implemented as 
||e(t, -)m (-) \\ 20 = (r 0 + 20 T Q + 0 T R0) 1 / 2 


r 0 = -/3r 0 + ^, r 0 (0) = 0 

where Q, R are defined in the integral adaptive law given by (8.5.47) 
In view of (8.5.81) we have 


(8.5.82) 


6 = — T(R6 + Q-g ) 

R = -flR + ^r, R( 0) = 0 (8.5.83) 

m z 

Q = -PQ-% Q(o) = 0 



8.5. ROBUST ADAPTIVE LAWS 


609 


where 

f 0 if || e(t, •)?«(') hp > 9o 

| ( R9 + Q) otherwise 

To avoid any implementation problems which may arise due to the discon- 
tinuity in g, the dead zone function is made continuous as follows: 

f 0 if ||e(t, -)w.(-)l|2/3 > 2c/ 0 

9= \ (R0 + Q)( 2 -MlM^) iig 0< \\e(t,-)m(-)\\ 2/3 <2g 0 (8.5.84) 

{ {RO + Q) if \\e(t,-)m(-)\\ 2 p < go 

The following theorem summarizes the stability properties of the adaptive 
laws developed above. 


Theorem 8.5.7 The adaptive laws (8.5.79) and (8.5.80) with P given by 
(8.5.71) or (8.5.72) and the integral adaptive law (8.5.83) with g given by 
(8.5.84) guarantee the following properties: 

(i) e,en s ,6,9 e Coo. 

(ii) e,en s ,9 <E S(g 0 + if/m 2 ). 

(hi) 0 <E C 2 C\Ci. 

(iv) lirn^oo 9(t) = 9 where 9 is a constant vector. 

(v) Ifn s ,(j) e Too a nd 4> is PE with level «o > 0 independent of rj, then 9(t) 
converges exponentially to the residual set 


D d = {0eU n \\9\ <c(g 0 + fj)} 


where g = sup t 


m(t) 


and c> 0 is a constant. 


Proof Adaptive Law (8.5.79) We consider the function 

Vfl = 


whose time derivative V along the solution of (8.5.79) where cm 2 = — 9 T cf> + g is 
given by 

V = 9 t <j>(e + g) = — (em 2 — g)(e + g) (8.5.85) 


Now 


(em + g 0 ) 2 - ( g 0 + ff )(em + g 0 ) > 0 if em < -g 0 


(em -g)(e+g) = < (em - g 0 ) 2 + (go - „)( eTO ~ 5 o) > 0 if em > g 0 (8.5.86) 


0 


if |em| < g 0 
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Hence, {em 2 — r/){e + g) > 0,Vi > 0 and V < 0, which implies that V, 9 £ £ oo and 
sj {an 2 — 77) (e d~ y) £ £2 ■ Furthermore, 9 £ £ ^ implies that 9, e, en s £ £00 • From 
(8.5.79) we have 

0 T n>, 

,2 


9 t 9 = 


m 


However, 


which, together with (8.5.86), implies that 

n * — ( r 1 „\ 2™2 ^ /,™2 


{e + g) 2 m 2 


(8.5.87) 

{em + g 0 ) 2 

if em < -go 


{em - g 0 ) 2 

if em > go 


0 

if em < go 


+ ff), i-e.,{ 

e + g)m £ £ 2 


that 9 £ £ 2 . 

Equation (8.5.85) 

can also 


be written as 


V” < — e 2 m 2 + lem 


M 


\rj\ 

\em\g 0 + —g 0 


m m 

by using \g\ < — . Then by completing the squares we have 


V < — 


M 2 


< -- 


2 2 
e m 


m 


3|j?| , O 2 


2m 2 


9 0 


which, together with V € £00 ? implies that em € 5(j§ + ^1). Because to 2 = 1 + n 2 
we have e,en s £ S(gy + {= 2 ). Because \g\ < ^ < cipo> we can show that |0| < 
(\em\ + g 0 ), which implies that 9 £ S{g% + ^U) due to em £ S{g% + Because g 0 

is a constant, we can absorb it in one of the constants in the definition of m.s.s. and 
• 2 

write em ,9 £ S {go + ^s) to preserve compatibility with the other modifications. 

To show that lim^^ 9 = 9, we use (8.5.85) and the fact that {em 2 —r]){e+g) > 0 
to obtain 


V = —{em 2 — rj){e + 3) = — 

\v\ 


V 

em 

m 


■g\i 


< 


< 


em — 


m 


9o 


M 


e + g\m 

e + g\m (because \e + g\ = 0 if \em\ < go) 


Because I! £ £00 and g 0 > — we integrate both sides of the above inequality and 
use the fact that V £ £oo to obtain that (e + g)m £ £\. Then from the adaptive 



8.5. ROBUST ADAPTIVE LAWS 


611 


law (8.5.79) and the fact that ^ £ £«,, it follows that \6\ £ £i, which, in turn, 
implies that the lim^^ f* 9dT exists and, therefore, 9 converges to 9. 

To show that 9 converges exponentially to the residual set Dd for persistently 
exciting 0, we follow the same steps as in the case of the fixed cr-modification or 
the e^-modification. We express (8.5.79) in terms of the parameter error 

§=-r^-0 + r^ + r<f>g (8.5.88) 

m. m z 

where g satisfies \g\ < — < C\go for some constant c\ > 0. Since the homogeneous 
part of (8.5.88) is e.s. when <j> is PE, a property that is established in Section 4.8.3, 
the exponential convergence of 9 to the residual set Dd can be established by re- 
peating the same steps as in the proof of Theorem 8.5.4 (iii). 

Adaptive Law (8.5.80) The proof for the adaptive law (8.5.80) is very 
similar to that of (8.5.79) presented above and is omitted. 

Adaptive Law (8.5.83) with g Given by (8.5.84) This adaptive law may 
be rewritten as 

9 = a e T f r)^(r)dr (8.5.89) 

J o 

where 

f 1 if ||e(f,-)rn(-)||2/3 > 2g 0 

a e= ) "dt-)m (-) \\ 20 _ 1 if go < || e (t,.)m (-)|| 2/ 3 < 2g 0 (8.5.90) 

l 0 if ||e(t,-)rn(-)||2/3 < go 

Once again we consider the positive definite function V (9) = — - whose time 

derivative along the solution of (8.5.83) is given by 

V = a e 9 r (t) f e - ,3 0- r ) e (£ ; r)^(r)dr 

Jo 

Using 0 T (t)(/i(r) = — e(f, t)to 2 (t) + ry(r), we obtain 

V = — a e [ e _/3 ^ _T )e(f, r)m( T ) e(t, r)rn(r) — dr (8.5.91) 

Jo i m T )_ 

Therefore, by using the Schwartz inequality, we have 

V < -a e \\e(t,-)m{-)\\ 2 g [iKVMOIh/? - ||( — )thp 

L TO j 

From the definition of a e and the fact that g 0 > sup t ||(^)t|| 2/ j + u i ^ follows 

that V < 0 which implies that V, 9, e £ C x and Hindoo = Vx exists. Also 9 £ Lx 
implies that en s £ C x - Using go > sup t ||(^)t|| 2 ^ + v i n (8.5.91), we obtain 

V < —va e \\e(t, ■) to (-)|| 2/3 
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Now integrating on both sides of the above inequality and using V £ Coo, we have 


<T e ||e(f,-)m(-)l|2/3 G A 


(8.5.92) 


Using (8.5.89) and the boundedness property of — , we can establish that \6\ € A . 

Moreover, from (8.5.83) with g given by (8.5.84) we also have that 9 is uniformly 

continuous which together with 9 £ A imply that lim^^ 9{t) = 0. Furthermore, 

using the same arguments as in the proof for the adaptive law (8.5.79), we can 

conclude from \9\ £ A that Hindoo 9 = 9 for some constant vector 9. 

2 

It remains to show that e, en s £ S{go + ^ 2 )- This can be done as follows: 
Instead of (8.5.91), we use the following expression for V : 


V = 9 r (t) 


3(1 r) 


e r 'r ' 'e(t, r)</>(r)dr 


bo 


9 T g 


= -9 T R9 + d T (t) / e 


-0(t-r) 

mHr) 


dr + 9' g 


obtained by using (8.5.83) instead of (8.5.89), which has the same form as equation 
(8.5.58) in the proof of Theorem 8.5.4. Hence, by following the same calculations 
that led to (8.5.59), we obtain 


V < 


9 T R9 
2 



(8.5.93) 


Using the definition of e(t, r), we obtain 


\R9 + Q| — 


< 


r e -0 (t-r) <I>(t){6 T - z(t)) 

Jo m2 


o-P(t-T) 


<j>(T)e(t,T)dT 


f ‘ „-3(t-r) \Ht)\ 2 


2 (t) 


dr 


\e(t,-)m{-) t \\ 2 0 


where the last inequality is obtained by using the Schwartz inequality. Because 
— £ Coo, we have 

m 005 

\R9 + Q\ < c 2 \\e(t , -)"i(-)t|| 2/3 (8.5.94) 

for some constant c 2 > 0. Using the definition of g given in (8.5.84) and (8.5.94), 
we have 

Ifll < c 9o 

for some constant c > 0 and therefore it follows from (8.5.93) that 


V < 


9 T R9 

2 



+ cg 0 


2 
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where c > 0 depends on the bound for \9\. 

The above inequality has the same form as (8.5.59) and we can duplicate the 
steps illustrated by equations (8.5.59) to (8.5.63) in the proof of Theorem 8.5.4 (A) 
to first show that R?9 G <S(g 0 + rj 2 /m 2 ) and then that e, en s , 9 G S(go + rj 2 /m 2 ). 
The proof for part (v) proceeds as follows: We write 

9 = —TR9 + T [ e -W-r)^ dr + r (8.5.95) 

Jo m 

Because the equilibrium 9 e = 0 of the homogeneous part of (8.5.95), i.e., 9 = — TR9 , 
is e.s. for persistently exciting a property that has been established in Section 
4.8.4, we can show as in the proof of Theorem 8.5.4 A(iii) that 

|6*| < /3oe _/32 * + Pi f e _/32 ^ _T ' l |c?(r)|dT 

Jo 

where d = f* e~^9~ T ) ^dr + g, po,(Ji ar e nonnegative constants and (3 2 > 0. 
Because ^ G £oo and ^ < fj, we have \d\ < cifj + \g\. Because \g\ < ego, we have 

1^1 < c(fj + g 0 ) + e t 

for some constant c > 0, where et is an exponentially decaying to zero term which 
completes the proof of part (v) . □ 


8.5.7 Bilinear Parametric Model 


In this section, we consider the bilinear parametric model 

z = W ( s)p * (0* T ^ + zo) + v 
?? = A U (s)u + A y (s)y + di 


(8.5.96) 


where z,ip,zo are measurable signals, W(s) is proper and stable, p*,9* 
are the unknown parameters to be estimated on-line and ?? is the model- 
ing error term due to a bounded disturbance d\ and unmodeled dynamics 
A u (s)u, A y (s)y. The perturbations A u (s),A y (s) are assumed without loss 
of generality to be strictly proper and analytic in Re[s] > —Sq/2 for some 
known do > 0. 

The techniques of the previous sections and the procedure of Chapter 4 
may be used to develop robust adaptive laws for (8.5.96) in a straightforward 


manner. 
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As an example, we illustrate the extension of the gradient algorithm to 
the model (8.5.96). We express (8.5.96) as 

z = p* (9* T (j) + zi) + 77 


where 4 > = W(s)^, z\ = W{s)zq. Then the gradient algorithm (based on the 
instantaneous cost) is given by 


9 = re0sgn(p*) — -tuiT# 

P = 7 e £ ~ W 2 IP 

z — z z — p(9 T (j) + z\) 
m 2 m 2 

£ = 9 t 4> + z\ 


(8.5.97) 


where 9,p is the estimate of 9 * , p* at time t; m > 0 is chosen so that 
m’m’m e wi,W 2 are leakage modifications and T = T t > 0,7 > 0 
are the adaptive gains. Using Lemma 3.3.2, the normalizing signal m may 
be chosen as 

m 2 = 1 + 4> T 4> + z\ + n 2 , n 2 = m s 
n i s = -5 0 m s + u 2 + y 2 , m s ( 0) = 0 

If A u , A y are biproper, then m 2 may be modified to include u, y, i.e., 
m 2 = 1 + + z\ + n 2 + u 2 + y 2 

If 4> = 7L(s)[u, y] T , = /i(s)[it, y] T , where H(s), h(s ) are strictly proper and 
analytic in Re[s] > —5q/2, then the terms (j) T (j),z 2 may be dropped from the 
expression of m 2 . 

The leakage terms w\ , rc 2 may be chosen as in Section 8.5.2 to 8.5.4. For 
example, if we use the switching cr-modification we have 

w\ = ai s , W 2 = (J 2 s (8.5.98) 

where 

f 0 if \xi\ < Mi 

&is = S ffo(W - 1) if Mi < \xi\ < 2 Mi 

(do if \xi\ > 2 Mi 

with 7 = 1,2 and \x\\ = | 6 *|, \x 2 \ = \p\, and ao > 0 is a design constant. The 
properties of (8.5.97) and (8.5.98) can be established in a similar manner as 
in Section 8.5.3 and are summarized below. 
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Theorem 8.5.8 The gradient algorithm (8.5.97) and (8.5.98) for the bilin- 
ear parametric model (8.5.96) guarantees the following properties. 

(i) e, en s ,p, 9, p,9 € Coo 

(ii) e,en s ,p,9 G S(p 2 /m 2 ) 


The proof of Theorem 8.5.8 is very similar to those for the linear parametric 
model, and can be completed by exploring the properties of the Lyapunov 
function V = + ^. 

As we discussed in Chapter 4, another way of handling the bilinear case 
is to express (8.5.96) in the linear form as follows 

z = W(s)(9*' T (f) + i 1 (8.5.99) 


where 6* = [9\,9^J] T ,9\ = p* , 6) = p*6* and = [zq, V ;T ] T - Then using the 
procedure of Sections 8.5.2 to 8.5.4, we can develop a wide class of robust 
adaptive laws for estimating 9* . From the estimate 9 of 9 * , we calculate the 
estimate 9, p of 9* and p* respectively as follows: 

p = h, 9 =°J- (8.5.100) 

VI 

where 9\ is the estimate of 9\ = p* and 02 is the estimate of 9 | = p*9*. 
In order to avoid the possibility of division by zero in (8.5.100), we use the 
gradient projection method to constrain the estimate 9 1 to be in the set 

C = [9 e n n+1 | g(9) =po- 9 lS gnp* < 0 } (8.5.101) 

where po > 0 is a lower bound for 1/3*1. We illustrate this method for the 
gradient algorithm developed for the parametric model z = 9* T <f> + 77 , 4> = 
W{s)ff = [zi,^> T ] T given by 


e 

e 


Te<j> - wT9 
z — z z — 9 T (f> 


(8.5.102) 


where to > 0 is designed so that ^ G £00 • We now apply the gradient 
projection method in order to constrain 9(t) G C Vf > to, i.e., instead of 
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(8.5.102) we use 

(T{ecj) — w0) if 9 e Co 

# = J or if 9 e 5(C) and [r(e<^> — w0)] T Vg < 0 

[ (i - r VgTrvg ) r ( e< ^ - w9) Otherwise. 

(8.5.103) 

where Co, 5(C) denote the interior and boundary of C respectively. 

Because Vg= [— sgn(p*), 0, . . . , 0] T , (8.5.103) can be simplified by choos- 
ing T = diag( 70, To) where 70 > 0 and To = rj > 0, i.e., 

! 7 0 ezi - ^qwOi if 0isgn (p*) > p 0 

or if 0isgn(p*) = po and (ez\ — w6i)sgn(p*) > 0 
0 otherwise 

9-2 = r 0 (e^> - iud 2 ) 

(8.5.104) 

The adaptive law (8.5.104) guarantees the same properties as the adap- 
tive law (8.5.97), (8.5.98) described by Theorem 8.5.8. 


8.5.8 Hybrid Adaptive Laws 

The adaptive laws developed in Sections 8.5.2 to 8.5.7 update the estimate 
9(t) of the unknown parameter vector 9* continuously with time, i.e., at each 
time t we have a new estimate. For computational and robustness reasons, 
it may be desirable to update the estimates only at specific instants of time 
tk where {tk} is an unbounded monotonically increasing sequence in 1Z + . 
Let tk = kT s where T s = tk + 1 — tk is the “sampling” period and k € A f + 
and consider the design of an adaptive law that generates the estimate of 
the unknown 9* at the discrete instances of time t = 0, T s , 2 T s , .... 

We can develop such an adaptive law for the gradient algorithms of Sec- 
tion 8.5.3. For example, let us consider the adaptive law 


0 = Te<j) — wT9 

z — z z — 9 T cj) 
m 2 m 2 

where z is the output of the linear parametric model 

z = 9* T <t> + 77 


(8.5.105) 
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and rn is designed so that ^ G Hoc- Integrating (8.5.105) from t k = kT s 
to tk+\ = (k + 1)T S we have 

Ak+i 

6 k+1 = 0 k + T / (e(r)^(r) - w(r)0 k )dT (8.5.106) 

■>t h 

where = 0(ffc). Equation (8.5.106) generates a sequence of estimates, i.e., 
do = 0(0), 0! = 0(T S ),0 2 = 0(2 T s ),...,0 k = 0(kT s ) of 0*. Although e, 0 
may vary with time continuously, 0(t) = 0 k = constant for t G [t k ,t k+ 1 ). In 

(8.5.106) the estimate z of z and e are generated by using 0 k , i.e., 

z(t) = 0k<t>(t), e = t e [t k ,t k+ 1) (8.5.107) 

We shall refer to (8.5.106) and (8.5.107) as the robust hybrid adaptive law. 

The leakage term w(t) chosen as in Section 8.5.3 has to satisfy some 
additional conditions for the hybrid adaptive law to guarantee similar prop- 
erties as its continuous counterpart. These conditions arise from analysis 
and depend on the specific choice for w(t). We present these conditions and 
properties of (8.5.106) for the switching cr-modification 

I 0 if 1^1 < M ° (a c ins'; 

u,(t) = <r„ = | if |<y > Mo (8.5.108) 

where <7o > 0 and Mq > 2\0*\. Because of the discrete-time nature of 

(8.5.106) , cr .5 can now be discontinuous as given by (8.5.108). 

The following theorem establishes the stability properties of (8.5.106), 
(8.5.108): 


Theorem 8.5.9 Let m, ao,T s ,T be chosen so that 

(a) ie£ 0 o,^<l 

(b) 2T s \ m < l,2a 0 X m T s < 1 


where X m = A maa; (r). Then the hybrid adaptive law (8.5.106), (8.5.108) 
guarantees that 


(i) e, en s € L OO? @k ^ ^OO 
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(ii) e, era £ <S(^), A0 k £ where A 6 k = 6 k+1 - 8 k and 

pk 0 +NT S 


A 


ko+N 

Y X k X k < c 0 

k=ko 


t'kr. 


V(y) = |{x fc } 
for some cq, c\ £ IZ + and any ko, N £ A f + 


y(r)dT + ci 


(iii) If n s ,(j> £ £00 end (f> is PE with a level of excitation «o > 0 that is 
independent ofr), then 


(a) 6 k = 6 k — 8* converges exponentially to the residual set 


Pn = 


{ & \0\ < c(o-o + ??)} 


where c > 0 , 77 = sup t \ ^\. 

(b) There exists a constant fj* > 0 such that if fj < fj*, then 8 k 
converges exponentially fast to the residual set 


Vg = 1 0 \8\ < cry | 


Proof Consider the function 


v{k) = el v-^ k 


(8.5.109) 


Using 9 k + 1 = 9 k + A9 k in (8.5.109), where A 9 k = T f** +1 (e(r)^(r) — w(r)9 k )dT, we 
can write 


V(k + 1) = V{k) + 29lT- 1 A0 k + A9lr- 1 A0 k 

ptk + 1 

= V(k) + 29f / 


' tk 


(e(r)^(r) - w(r)9 k )dT 

tk + 1 
t k 


(8.5.110) 


ptk + 1 rtk + 1 

+ (e(r)^(r)-«;(T)6» fc ) T cirr / (e(r)</>(r)-w(T)0 fc )dT 

«/ 1 /, 


Because 9jf> = —em 2 + 77 , we have 


ptk- 1-1 + l 

9j / e(r)</>(r)dT = / (— e 2 m 2 + e? 7 )dr < — 


f tfc +i e 2 m 2 




dr - 


tfc 


'ifc 


2 m 2 


dr 


(8.5.111) 
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c\ 2 i 2 

where the last inequality is obtained by using the inequality — a z + ab < — + \. 

Now consider the last term in (8.5.110), since 


rtk+i rtk+i 

/ (e(r)^(r) - w(T)9 k ) T dTV / (e(r)^(r) - w(T)9 k )dT 

J tk J tk 


^ Am 


f‘ik + 1 


(e(r)^(r) - w(r)9 k )dT 


't k 


where X m = A maa; (r 1 ), it follows from the inequality (a + b) 2 < 2a 2 + 2 b 2 that 

rtk+i rtk+i 

/ (e(r)^(r) - w(T)9 k ) T dTV / (e(r)^(r) - w(T)9 k )dT 

J tk J tk 

Jt k m(T) 


2X 


’tk 

rtk+i rtk+ 1 I J 12 

2A m / e 2 (r)?n 2 (r)fir / -^-dr + 2A m cr 2 T 2 |6» fc | 

Jtk Jtk m 


2 

rtk+i 

+ 2A m 

/ w{r)9 k dT 

Jt k 


tk+i U|2 


< 


2^2 | |2 




< 2X m T, 


l (T)m 2 (r)dT + 2A m cr 2 T 2 |6» fc | 2 


(8.5.112) 




In obtaining (8.5.112), we have used the Schwartz inequality and the assumption 
— < 1. Using (8.5.111), (8.5.112) in (8.5.110), we have 


Ufc+i Ufc+i | ?1 |2 

U(fc+1) < V (k) — (1 — 2X m T s ) / e 2 (r)m 2 (r)dr+ / ^ 

Jt k Jt k ™ 

-2 a s T s 9j9 k + 2X m a 2 T 2 \9 k \ 2 

rtk + 1 

< V(k) - (1 - 2X m T s ) / e 2 (r)m 2 (r)dr + T s 

Jt k 

— 2<j s T s (9 k 9 k — X m a 0 T s \9 k \ 2 ) 

rtk+i 

< V{k) - (1 - 2X m T s ) / e 2 (r)m 2 (r)dr + fj 2 T s 

Jtk 


dr 


-2a s T s ( (* - A m cr 0 T s )|6» fc | 2 - ® 


(8.5.113) 


where the last inequality is obtained by using 9j9 k > Therefore, for 

1 — 2<JoX m T s > 0 and 1 — 2X m T s > 0, it follows from (8.5.113) that V(/c + l) < V(k) 
whenever 

Thus, we can conclude that V(k) and 9 k € 1^. The boundedness of e, em follows 

immediately from the definition of e and the normalizing properties of m, i.e., 

A 2L e £ 

m ’ m 00 ’ 
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To establish (ii), we use the inequality 

°s9jo k = y ele k + (^e T k e k -a s e T k e*) 

> yl^| 2 + yl^l(l^|- 2|fl*|) 

Because a s = 0 for \9 k \ < Mo and cr s > 0 for \9k\ > M 0 > 2\9*\, we have cr s \9 k \{\9 k \ — 
2\9*\) > 0 Vfc > 0, therefore, 

V S 0k9 k > y |6> fc | 2 

which together with 2aoX m T s < 1 imply that 

a s (9j9 k - X m a 0 T s \9 k \ 2 ) > c a a s \9 k \ 2 (8.5.114) 

where c CT = \ — X m a 0 T s . From (8.5.114) and the second inequality of (8.5.113), we 
have 

rtfo . |_i 2 

(1 — 2X m T s ) / e 2 (r)?n 2 (r)dr + Co-o-^l^l 2 < ^(fc) - y(/c + 1) + / -^dr 

7t fc m 

(8.5.115) 

2 2 

which implies that cm G iS(^) and G (^). Because |e| < |ern| (because 

m > l,Vf > 0 ), we have e € 

Note from (8.5.112) that A 9 k satisfies 

ko~\-N ko~\~N ptk+i ko~\-N 

(A9 k ) T A9 k < 2X m T s Y J / e 2 (r)m 2 (r)dr + 2A m T 2 ^ a 2 s \9 k \ 2 

k—ko k—ko k—ko 

rtk 0 +N ko+N 

< 2A m T s / e 2 (r)?77. 2 (r)fir+2A m T 2 ^ a 2 \6 k \ 2 (8.5.116) 

Jtk o fc=fe 0 

2 2 

Using the properties that em G S(A I ) 1 a s 9 k G X>(^ 2 ), we have 


fco+Af 

E 

k—kg 


(A9 k ) T A9 k < ci + C 2 




'tfcn 


?n z 


for some constant ci,c 2 > 0. Thus, we conclude that Ad fc G X>(-(U). 

Following the same arguments we used in Sections 8.5.2 to 8.5.6 to prove pa- 
rameter convergence, we can establish (iii) as follows: We have 


9 k + 1 = o k - r 


'tk 


K (j>(r)4> T (t) 
m 2 (r) 


dr9 k + r 


ftk+i <^( t )? 7 ( t ) 
t k m2 ( r ) 


dr - Ta s 9 k T s 
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which we express in the form 


0k + 1 = A(k)9 k + Bv{k) 


(8.5.117) 


where A(k) = I -T jJ +1 H ^ r \ r) dT, B = T, u(k ) = / t ^ +1 

We can establish parameter error convergence using the e.s. property of the homo- 
geneous part of (8.5.117) when <j> is PE. In Chapter 4, we have shown that (f> being 
PE implies that the equilibrium 9 e = 0 of 9 k + 1 = A(k)9 k is e.s., i.e., the solution 9 k 
of (8.5.117) satisfies 


1 4|</?o7 fc + /?iE^“>* 


(8.5.118) 


for some 0 < 7 < 1 and /3q, Pi > 0. Since \vi\ < c(g + ctq) for some constant c > 0, 
the proof of (iii) (a) follows from (8.5.118). From the definition of u k , we have 


Wk\ < C 0 r] + d\a s 9 k \ < C 0 r] + c 1 \a s 9 k \ \9 k \ 


(8.5.119) 


for some constants Co,Ci,c l5 where the second inequality is obtained by using 
cr s \9k\ < < m 0 -\8*\ \ a s9k\ \0k\- Using (8.5.119) in (8.5.118), we have 


\0k\ < Pol k + PiV + P 2 Y, 7 fc “i <rs0i\\9i 


(8.5.120) 


where p[ = f ^,/32 = PiCi- 

To proceed with the parameter convergence analysis, we need the following 
discrete version of the B-G Lemma. 


Lemma 8.5.1 Let x(k), f(k), g(k) be real valued sequences defined for k = 
0 , 1 , 2 ,...,, and f(k), g(k), x(k) > 0 Vfc. If x{k) satisfies 

k - 1 

x(k) < f(k ) + g{i)x(i), k = 0 , 1 , 2 , . . . 

i=0 

then, 

k— 1 k — 1 

x(k)<f(k) + Y^ Ylil + gU)) 9{i)f{i), k = 0,1,2,... 

2—0 j=i-\-l 

where Ilj=i+i(l + dU)) se t equal to 1 when i = k — 1 . 
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The proof of Lemma 8.5.1 can be found in [42]. 

Let us continue the proof of Theorem 8.5.7 by using Lemma 8.5.1 to obtain 
from (8.5.120) that 


fc-i ( fc-i I 

\@k\ < /?o7 fc + /3iV + n (1 + /3 2 cr s |^|) > f3 2 \ cr s 0i\(Po + Pirn 1 ) (8.5.121) 

i= o [ j-i * 1 J 

Using the inequality J][” = i Xi < (ET=i x i/ n ) n which holds for any integer n and 
positive Xi, we have 


fc-i 




k—i—1 


E--£/3 2 ^-l 


j=i + 1 


k — i — 1 


= 1 + 


k—i— 1 


k — i — 1 


Because a s \9 k \ < it follows from (8.5.115) that 


(8.5.122) 


c s |0 fe | < ciV(fc) - V(k + 1) + [ £^dr 

l dt fe m 2 


for some constant c, which implies that 

k - 1 


£ /32CT s |0j| < Ci77 2 (fc - * - 1) + Co 

j=i + 1 

for some constant ci,Cq > 0. Therefore, it follows from (8.5.122) that 


fc-i 


(1 + fl2<Xs\0j\) < (l + 

j=i+l 


ciry 2 (fc — i — 1) + c 0 


k — i — 1 


k—i—1 


< ( 1 + ciry 2 + 


Co 


k — i — 1 


k—i—1 


(8.5.123) 


Because for x > 0, the inequality (1 + ^ ) n < e x holds for any integer n > 0, it 
follows from (8.5.123) that 


fc-i 


k—i—1 — c ° 

1 c 1 r ] 2 


]^[ (1 + /?2cr s |^j|) < (l + Cl?? 2 ) e 1+ 

j—i+ 1 


(8.5.124) 


Using (8.5.124) in (8.5.121), we have 


k - 1 


\e k \ < Pol k + p'lfj + £7 fe (l + c 1 tf) k -'- 1 mm»2<r s mf3o + fa 7“') (8.5.125) 


2=0 
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Because lb ay | <9,; G and 7 < 1, the inequality (8.5.125) leads to 

k - 1 k - 1 

\0k\ < + P 1 V + p 3 (Vi) k ^2{Vi( l Fcif)} k 1 1 + + c -if)] k ~ l 

i=0 2=0 

for some constants /?3, /?4 >0. If fj is chosen to satisfy v / y(l + c\fj 2 ) < 1, i.e., for 
fj* 2 = > 0 and fj 2 < fj* 2 , we obtain 

\h\ < 6 o7 fe + PiV + P' 3 (Vl) k + PVI 

for some constants (3 3 ,/3 4 > 0. Therefore, 9k converges exponentially to a residual 
set whose size is proportional to fj. □ 


8.5.9 Effect of Initial Conditions 

The effect of initial conditions of plants such as the one described by the 
transfer function form in (8.5.1) is the presence of an additive exponentially 
decaying to zero term 770 in the plant parametric model as indicated by 
equation (8.5.8). The term 770 is given by 

Cj = A c cj, u(0) = wo 
% = Cj CJ 

where A c is a stable matrix and u 0 contains the initial conditions of the 
overall plant. As in the ideal case presented in Section 4.3.7, the term ?/o 
can be taken into account by modifying the Lyapunov-like functions V(6) 
used in the previous sections to 

V m (6) = V(0)Pi. v t P 0 lu 

where Po = Po > 0 satisfies the Lyapunov equation PoA c + A J Pq = —70 1 
for some 70 > 0 to be chosen. As in Section 4.3.7, it can be shown that the 
properties of the robust adaptive laws presented in the previous sections are 
not affected by the initial conditions. 


8.6 Summary of Robust Adaptive Laws 

A robust adaptive law is constructed in a straight forward manner by mod- 
ifying the adaptive laws listed in Tables 4.1 to 4.5 with the two crucial 
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modifications studied in the previous sections. These are: the normalizing 
signal which is now chosen to bound the modeling error term r/ from above 
in addition to bounding the signal vector cp, and the leakage or dead zone or 
projection that changes the “pure” integral action of the adaptive law. 

We have illustrated the procedure for modifying the adaptive laws of 
Tables 4.1 to 4.5 for the parametric model 

z = W(s)[0* T ip + W~ 1 (s)ij\, rj = A u (s)u + A y (s)y + d\ (8.6.1) 
which we can also rewrite as 


z = 9* T cj) + rj 


( 8 . 6 . 2 ) 


where (p = W(s)ip. Without loss of generality, let us assume the following: 

51. W(s) is a known proper transfer function and W(s), 1T -I (s) are ana- 
lytic in Re[s] > — do/2 for some known do > 0. 

52. The perturbations A u (s),A y (s) are strictly proper and analytic in 
Re[s] > — do/2 and do > 0 is known. 

53. ip = Hq(s)[u, y] T where Hq(s) is proper and analytic in Re[s] > —do/2. 


For the LTI plant models considered in this book, assumptions SI and S3 
are satisfied by choosing the various design polynomials appropriately. The 
strict properness of A u , A y in assumption S2 follows from that of the overall 
plant transfer function. 

Let us first discuss the choice of the normalizing signal for parametric 
model (8.6.1) to be used with the SPR-Lyapunov design approach. We 
rewrite (8.6.1) as 

z f = W (s) L(s)[6* T cp f + rj f ] (8.6.3) 


with 


z f = 


ho 


h 0 


s + h 0 Z ’ s + h 0 


L 1 (s)ip 


Vf = L A (s) 


- 1 1 


ho 


s + ho 


(A u (s)u + A y (s)y + di) 


by filtering each side of (8.6.1) with the filter . We note that L(s) is 
chosen so that WL is strictly proper and SPR. In addition, L~ 1 (s), L(s) are 
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analytic in Refs] > —So/2 and ho > So/2. Using Lemma 3.3.2, we have from 
assumptions SI to S3 that the normalizing signal m generated by 

2 i i 2 2 

m = 1 + n s , n s = m s 

rh s = -Som s + u 2 + y 2 , m s { 0) = 0 (8.6.4) 

bounds both (frfiVf from above. The same normalizing signal bounds rj, </> in 
the parametric model (8.6.2). 

In Tables 8.1 to 8.4, we summarize several robust adaptive laws based 
on parametric models (8.6.1) and (8.6.2), which are developed by modifying 
the adaptive laws of Chapter 4. Additional robust adaptive laws based on 
least-squares and the integral adaptive law can be developed by following 
exactly the same procedure. 

For the bilinear parametric model 

z = p*(9* T <j) + zi) + rj, p = A u (s)u + A y (s)y + d (8.6.5) 

the procedure is the same. Tables 8.5 to 8.7 present the robust adaptive laws 
with leakage, projection and dead zone for the parametric model (8.6.5) when 
the sign of p* is known. 


8.7 Problems 

8.1 Consider the following system 

200 

y = e (s- P )( s + m) u 

where 0 < r <1 and p « 1. Choose the dominant part of the plant and 
express the unmodeled part as a (i) multiplicative; (ii) additive; and (iii) 
stable factor perturbation. 

Design an output feedback that regulates y to zero when r = 0.02. 

8.2 Express the system in Problem 8.1 in terms of the general singular perturbation 

model presented in Section 8.2.2. 

8.3 Establish the stability properties of the switching-cr modification for the exam- 

ple given in Section 8.4.1 when 0 < Mq < |0*|. 
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Table 8.1 Robust adaptive law based on the SPR-Lyapunov 

method 


Parametric 

model 

z = W ( s)9* t 4> + 77, 77 = A U (s)u + A y (s)y + d 

4> = Hq(s)[u, y\ T 

Filtered 

parametric 

model 

z f = W{s)L(s){e^4>f+ri f )A f = 1 ^ i W{s)L-\s)cl> 
z f= s +Lur)f = W 1 (s)L 

Adaptive law 

6 = Te<j>- TwO 
e = z — z — W (s)L(s)en 2 

Normalizing 

signal 

m 2 = 1 + nj, nj = m s 

m s = -5 0 m s + u 2 + y 2 , m s { 0 ) = 0 

Leakage 

(a) Fixed a 

(b) Switching a 

(c) e-modification 

w = a 

r 0 if \e\ < m 0 

w = Gs = | (ft - X ) a o if M 0 < \e\ < 2M 0 
{ a 0 if \9\ > 2M 0 

w = |em|^o 

Assumptions 

(i) A U ,A y strictly proper and analytic in Refs] > 

— 5q/2 for some known Jo > 0; (ii) W(s), Ho(s) are 
known and proper and W(s), VF _ 1 (s), Hq(s) are an- 
alytic in Refs] > —Jo/2 

Design 

variables 

T = T 1 > 0 ; (j 0 > 0 ; vq > 0 ; ho > <5o/2; L _ 1 (s ) 
is proper and L(s), L~ 1 (s) are analytic in Refs] > 
—Jo/2; W(s)L(s) is proper and SPR 

Properties 

(i) e, em, 6, 6 G (ii) e,em,6 G S (/ 0 + r/ 2 / m 2 ) , 

where /o = er for w = cr, /o = 0 for tc = cr s , and 
/o = ^0 for re = |em |^ 0 
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Table 8.2 Robust adaptive law with leakage based on the 

gradient method 


Parametric model 

z = 9* T cj) + 77, ij = A u (s)u + A y (s)y + d 

Adaptive law 

Q = Tej)- Tw6 
z — 0 T (j> 
m 2 

Normalizing 

signal 

As in Table 8.1 

Leakage w 

As in Table 8.1 

Assumptions 

(i) A u ,A y strictly proper and analytic in 
Refs] > —5q/2 for some known do > 0; (ii) 0 = 
H(s)[u, y\ T , where H(s ) is strictly proper and an- 
alytic in Refs] > — 5q/2. 

Design 

variables 

r = r T > 0; The constants in w are as in Table 
8.1. 

Properties 

(i) e, era, 9, 9 £ (ii) e,em,9 £ S(if/m 2 + / 0 ), 

where /o is as defined in Table 8.1. 


8.4 Consider the following singular perturbation model 


x = Aux + A 12 Z + b\u 
HZ = A 22 Z + b 2 u 

where x £ 1 Z n ,z £ 7Z m ,y £ 7Z 1 and A 2 2 is a stable matrix. The scalar 
parameter /i is a small positive constant, i.e., 0 </j< 1. 

(a) Obtain an nth-order approximation of the above system. 

(b) Use the transformation 77 = z + A 22 b 2 U to transform the system into one 

with states x, 77. 

(c) Show that if u = —Kx stabilizes the reduced-order system, then there 

exists a h* > 0 such that u = — Kx also stabilizes the full-order system 
for any y £ [0, fi*). 
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Table 8.3 Robust adaptive law with projection based on the 

gradient method 


Parametric 

model 

z = 0* T (f> + 77, 77 = A u (s)u + A y (s)y + d 

Adaptive law 

(Tec/) if \9\ < M 0 

Q = J or if \9 \ = Mq and (rec/>) T 0 < 0 

y (/ — Te(f> otherwise 

z — 0 T cj) 
m, 2 

Normalizing 

signal 

As in Table 8.1 

Assumptions 

As in Table 8.1 

Design 

variables 

|0(o)| <M o; m 0 > | 0 *| ; r = r T >0 

Properties 

(i) e,em,0,Q e (ii) e,em,9 € S{^ 2 /m 2 ) 


8.5 Establish the stability properties of the shifted leakage modification —w(9 — 0*) 

for the three choices of w and example given in Section 8.4.1. 

8.6 Repeat the results of Section 8.4.1 when u is piecewise continuous but not 

necessarily bounded. 

8.7 Repeat the results of Section 8.4.2 when u is piecewise continuous but not 

necessarily bounded. 

8.8 Repeat the results of Section 8.4.3 when u is piecewise continuous but not 

necessarily bounded. 

8.9 Simulate the adaptive control scheme given by equation (8.3.16) in Section 8.3.3 

for the plant given by (8.3.13) when b = 2. Demonstrate the effect of large 
7 and large constant reference input r as well as the effect of switching-off 
adaptation by setting 7 = 0 when instability is detected. 
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Table 8.4 Robust adaptive law with dead zone based on the 

gradient Method 


Parametric model 

z = 0* T 4> + rj, r) = A u (s)u + A y (s)y + d 

Adaptive law 

8 = F0(e + g) 

( ^ if ™ < ~9o 

9 = \ m if em > 90 

{ -e if \em\ < g 0 
z — 9 T cj) 
m 2 

Normalizing signal 

As in Table 8.1 

Assumptions 

As in Table 8.1 

Design variables 

g 0 > M ; r = r T > o 

Properties 

(i) e, em, 8,8 6 (ii) e, em., 9 £ S{y 2 /rn 2 + g 0 )-, 

(iii) Hindoo 8(t) = 8 


8.10 Consider the closed-loop adaptive control scheme of Section 8.3.2, i.e., 

x = ax + z — u 
gz = —2 + 2 u 
V = x 

u = —kx, k = 'yx 2 

Show that there exists a region of attraction V whose size is of O(A-) for 
some a > 0 such that for a:(0), 2 ( 0 ), fc(0) £ V we have x(t),z(t) — > 0 and 
k(t) — > fc(oo) as t — » oo. (Hint: use the transformation rj = z — 2 u and 
choose V = \ ^ + g where k* > a.) 
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Table 8.5 Robust adaptive law with leakage for the bilinear 

model 


Parametric 

model 

^ = p*(0* T (j) + zi) + ry, ?y = A U (s)u + A y (s)y + d 

Adaptive law 

6 = Te4>sgn(p*) — wiT8 

P = 7^ - W21P 

e= , £ = 9 T 4> + z i 

m z 

Normalizing 

signal 

As in Table 8.1. 

Leakage r w t 
i = 1,2 

As in Table 8.1 

Assumptions 

(i) A u ,A y are as in Table 8.1; (ii) 4> = H(s)[u, y\ T , 
zi = hi(s)u + h, 2 (s)y, where H(s), hi(s), /i 2 (s) are 
strictly proper and analytic in Refs] > — 5o/2 

Design 

variables 

T = r T > 0 , 7 > 0 ; the constants in Wi are as defined 
in Table 8.1 

Proporties 

(i) e,em,p,6,p,9 G C ^ (ii) e,em,p,9 G S(p 2 /m 2 + 
fo), where /o is as defined in Table 8.1 


8.11 Perform simulations to compare the properties of the various choices of leakage 
given in Section 8.4.1 using an example of your choice. 

8.12 Consider the system 

y = 9*u + r\ 
r i = A (s)u 

where y,u are available for measurement, 9* is the unknown constant to 
be estimated and ry is a modeling error signal with A(s) being proper and 
analytic in Refs] > —0.5. The input u is piecewise continuous. Design an 
adaptive law with a switching-er to estimate 9* . 




8.7. PROBLEMS 


631 


Table 8.6 Robust adaptive law with projection for the bilinear 

model 


Parametric 

model 

Same as in Table 8.5 

Adaptive 

law 

r Titfa if \0i\< Mi 

0 . = J or if \ 6 i\ = Mi and (Pje^) T i9j < 0 

| (i gT r ^ otherwise 

i = 1,2 with 

e 1 = 9 , 0 2 =p,(j)i=(j)sgn(p*),(j )2 = ^,ri = r,r 2 = 7 

e=^, a = 0 T t + z 1 

Assumptions 

As in Table 8.5 

Normalizing 

signal 

As in Table 8.1 

Design 

variables 

|0(O)| < Ml, |p(0)| < M 2 ; r = r T > 0,7 > 0 

Properties 

(i) e, era, p, 0,p,6 G Too; (h) e, era, p,0 € S{j] 2 /m 2 ) 


8.13. The linearized dynamics of a throttle angle 9 to vehicle speed V subsystem 
are given by the 3rd order system 

T/ _ bp!P2 

V — 7 77 77 7 0 + d 

(s + a)(s + Pi)(s + P2) 

where pi,P 2 > 20, 1 > a > 0 and d is a load disturbance. 

(a) Obtain a parametric model for the parameters of the dominant part of 

the system. 

(b) Design a robust adaptive law for estimating on-line these parameters. 

(c) Simulate your estimation scheme when a = 0.1, b = l,pi = 50, P 2 = 100 

and d = 0.02 sin 5f. 
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Table 8.7 Robust adaptive law with dead zone for the bilinear 

model 


Parametric 

model 

Same as in Table 8.5 

Adaptive law 

9 = T<j>(e + g)sgn(p*) 

P = 7 £(e + s) 

f ^ if em < -g 0 

9 = \ if em > 9o 

1 — e if em| < go 

e=^, H = 9 T <t> + z 1 

Assumptions 

As in Table 8.5 

Normalizing 

signal 

Same as in Table 8.1 

Design 

variables 

go > M; r = r T > 0 , 7 > 0 

Properties 

(i) e, em, p, 9, p, 9 G (ii) e, em, p, 

9 G S(r] 2 /m 2 + go); (hi) Hindoo 9(t) = 0 


8.14 Consider the parameter error differential equation 

6 > = —^u 2 0 + jdu 


that arises in the estimation problem of Section 8.3.1 in the presence of a 
bounded disturbance d. 

(a) Show that for d = 0 and u = — 1 — r , the equilibrium 9 e = 0 is u.s and 

(l + t) 2 

a.s but not u.a.s. Verify that for 



d(t) = (1 + t) « 


2 ( 1—0 ^ 
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u = (1 + t) 2 and 7 = 1 we have y — > 0 as t — > oo and 0(f) — > oo as 
t — > oo. 

(b) Repeat the same stability analysis for u = uo where ito 7^ 0 is a constant, 
and show that for d = 0, the equilibrium 9 e = 0 is u.a.s. Verify that 0(f) 
is bounded for any bounded d and obtain an upper bound for |0(f)|. 

8.15 Repeat Problem 8.12 for an adaptive law with (i) dead zone; (ii) projection. 

8.16 Consider the dynamic uncertainty 

V = A u (s)u + A y (s)y 

where A u ,A y are proper transfer functions analytic in Re[s] > — for some 
known 6q > 0. 


(a) Design a normalizing signal m that guarantees 77/m € Coo when 

(i) A u , A y are biproper. 

(ii) A U ,A y are strictly proper. 

In each case specify the upper bound for 

(b) Calculate the bound for |??|/m when 


(i) A u (s) = 

(ii) A u (s) = 


~ TS - 1 s 2 

7T^' a » (s) = , ‘pny 


ps _ /is 

fis + 2’ v[S> (/rs+1) 2 


where 0<(iCl and 0 < r < 1. 


8.17 Consider the system 

e~ TS b 

^ (s + o)(/Lts + l) 

where 0<r<Cl,0</.i<Cl and a,b are unknown constants. Obtain a 
parametric model for 6* = [b, a] T by assuming r « 0,/x « 0. Show the effect 
of the neglected dynamics on the parametric model. 



Chapter 9 

Robust Adaptive Control 
Schemes 

9.1 Introduction 

As we have shown in Chapter 8, the adaptive control schemes of Chapters 4 
to 7 may go unstable in the presence of small disturbances or unmodeled 
dynamics. Because such modeling error effects will exist in any implementa- 
tion, the nonrobust behavior of the schemes of Chapters 4 to 7 limits their 
applicability. 

The purpose of this chapter is to redesign the adaptive schemes of the 
previous chapters and establish their robustness properties with respect to a 
wide class of bounded disturbances and unmodeled dynamics that are likely 
to be present in most practical applications. 

We start with the parameter identifiers and adaptive observers of Chap- 
ter 5 and show that their robustness properties can be guaranteed by de- 
signing the plant input to be dominantly rich. A dominantly rich input is 
sufficiently rich for the simplified plant model, but it maintains its richness 
outside the high frequency range of the unmodeled dynamics. Furthermore, 
its amplitude is higher than the level of any bounded disturbance that may 
be present in the plant. As we will show in Section 9.2, a dominantly rich 
input guarantees exponential convergence of the estimated parameter errors 
to residual sets whose size is of the order of the modeling error. 

While the robustness of the parameter identifiers and adaptive observers 
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of Chapter 5 can be established by simply redesigning the plant input with- 
out having to modify the adaptive laws, this is not the case with the adaptive 
control schemes of Chapters 6 and 7 where the plant input is no longer a 
design variable. For the adaptive control schemes presented in Chapters 6 
and 7, robustness is established by simply replacing the adaptive laws with 
robust adaptive laws developed in Chapter 8. 

In Section 9.3, we use the robust adaptive laws of Chapter 8 to mod- 
ify the MRAC schemes of Chapter 6 and establish their robustness with 
respect to bounded disturbances and unmodeled dynamics. In the case of 
the MRAC schemes with unnornralized adaptive laws, semiglobal bounded- 
ness results are established. The use of a dynamic normalizing signal in the 
case of MRAC with normalized adaptive laws enables us to establish global 
boundedness results and mean-square tracking error bounds. These bounds 
are further improved by modifying the MRAC schemes using an additional 
feedback term in the control input. By choosing a certain scalar design pa- 
rameter r in the control law, the modified MRAC schemes are shown to 
guarantee arbitrarily small £oo bounds for the steady state tracking error 
despite the presence of input disturbances. In the presence of unmodeled 
dynamics, the choice of r is limited by the trade-off between nominal per- 
formance and robust stability. 

The robustification of the APPC schemes of Chapter 7 is presented in 
Section 9.5. It is achieved by replacing the adaptive laws used in Chapter 7 
with the robust ones developed in Chapter 8. 

9.2 Robust Identifiers and Adaptive Observers 

The parameter identifiers and adaptive observers of Chapter 5 are designed 
for the SISO plant model 

y = G 0 (s)u (9.2.1) 

where Go(s) is strictly proper with stable poles and of known order n. In 
this section we apply the schemes of Chapter 5 to a more realistic plant 
model described by 

V = Gq(s)(1 + A m (s))(u + d u ) (9.2.2) 

where G(s) = Go(s)(l + A m (s)) is strictly proper of unknown degree; A m (s) 
is an unknown multiplicative perturbation with stable poles and d u is a 
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bounded disturbance. Our objective is to estimate the coefficients of Go(s) 
and the states of a minimal or nonmininral state representation that cor- 
responds to Go(s), despite the presence of A m (s), d u . This problem is, 
therefore, similar to the one we would face in an actual application, i.e., 
(9.2.1) represents the plant model on which our adaptive observer design is 
based and (9.2.2) the plant to which the observer will be applied. 

Most of the effects of A m (s),d u on the robustness and performance of 
the schemes presented in Chapter 5 that are designed based on (9.2.1) may 
be illustrated and understood using the following simple examples. 

Example 9.2.1 Effect of bounded disturbance. Let us consider the simple 
plant model 

y = Q*u + d 

where d is an external bounded disturbance, i.e., \d(t)\ < do,Vf > 0,it G £oo and 9* 
is the unknown constant to be identified. The adaptive law based on the parametric 
model with d = 0 is 

9 = 7eiit, ei = y — 9u (9.2.3) 

which for d = 0 guarantees that ei, 9 G and e 1? 9 G £ 2 - If, in addition, u is PE, 
then 9(t) — » 9* as t — > oo exponentially fast. When d 0, the error equation that 
describes the stability properties of (9.2.3) is 

9 = —7 u 2 9 + 7 ud (9.2.4) 

where 9 = 9 — 9*. As shown in Section 8.3, if u is PE, then the homogeneous part 
of (9.2.4) is exponentially stable, and, therefore, the bounded input "fud implies 
bounded 9. When u is not PE, the homogeneous part of (9.2.4) is only u.s. and 
therefore a bounded input does not guarantee bounded 9. In fact, as shown in 
Section 8.3, we can easily find an input u that is not PE, and a bounded disturbance 
d that will cause 9 to drift to infinity. One way to counteract parameter drift and 
establish boundedness for 9 is to modify (9.2.3) using the techniques of Chapter 8. 
In this section, we are concerned with the parameter identification of stable plants 
which, for accurate parameter estimates, requires u to be PE independent of whether 
we have disturbances or not. Because the persistent excitation of u guarantees 
exponential convergence, we can establish robustness without modifying (9.2.3). 
Let us, therefore, proceed with the analysis of (9.2.4) by assuming that u is PE 
with some level ao > 0, i.e., u satisfies 

t+T 

u 2 dr > aoT, Vt > 0, for some T > 0 
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Then from (9.2.4), we obtain 

|0(t)| < yfcie- 7a ^|0(O)| + ^ (1 - e-^) sup |u(r)d(r)| 

a 0 r<t 

for some constants ki,a 0 > 0, where a 0 depends on a o- Therefore, we have 

lim sup|6*(r)| < — lim sup |u(r)d(r)| = — sup |u(r)d(r) | (9.2.5) 

t-^oo r >£ Qig t^OO Og r 

The bound (9.2.5) indicates that the parameter identification error at steady state 
is of the order of the disturbance, i.e., as d — > 0 the parameter error also reduces 
to zero. For this simple example, it is clear that if we choose u = ug, where ug is 
a constant different from zero, then a 0 = Ug,fci = 1; therefore, the bound for \9\ 
is sup t \d(t)\/uQ. Thus the larger the uq is, the smaller the parameter error. Large 
Mo relative to \d\ implies large signal-to-noise ratio and therefore better accuracy of 
identification. 

Example 9.2.2 (Unmodeled Dynamics) Let us consider the plant 

V = 0*(1 + A m (s))u 

where A m (s) is a proper perturbation with stable poles, and use the adaptive law 
(9.2.3) that is designed for A m (s) = 0 to identify 0* in the presence of A m (s). The 
parameter error equation in this case is given by 

9 = — 7u 2 (9 + jur), ri = 9*A m (s)u (9.2.6) 

Because u is bounded and A m (s) is stable, it follows that rj £ and therefore 
the effect of A m (s) is to introduce the bounded disturbance term rj in the adaptive 
law. Hence, if u is PE with level «o > 0, we have, as in the previous example, that 

f. 

lim sup|0(r)| < — sup \u(t)r](t)\ 

t.^oo T > t a 0 t 

The question now is how to choose u so that the above bound for \9\ is as small 
as possible. The answer to this question is not as straightforward as in Example 
9.2.1 because ij is also a function of u. The bound for \9\ depends on the choice 
of u and the properties of A m (s). For example, for constant u = uq yf 0, we have 
a 0 = Uq, ki = 1 and rj = A m (s)u 0 , i.e., lim^oc, \r](t)\ = |A m (0)||u o |, and, therefore, 

lim sup |(9(r)| < |A m (0)| 

t^oo r > t 

If the plant is modeled properly, A m (s) represents a perturbation that is small (for 
s = jaj) in the low-frequency range, which is usually the range of interest. Therefore, 
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for u = uq, we should have |A m (0)| small if not zero leading to the above bound, 
which is independent of uq. Another choice of a PE input is u = cos u>ot for some 
ujq ^ 0. For this choice of u, because 


o — 7 J g cos 2 wo Tdr o — 


= e 




"o ' = e 


’e V < e V 


(where we used the inequality t+ sm ^°* > 0,Vt > 0) and sup t \r](t)\ < |A m (jw 0 )|, 
we have 

lim sup \0{t)\ < 4|A m (jw 0 )| 

t^oo t > 4 


This bound indicates that for small parameter error, wo should be chosen so that 
|A m (jwo)| is as small as possible. If A m (s) is due to high frequency unmodeled 
dynamics, |A m (ju>o)| is small provided ojq is a low frequency. As an example, 
consider 


A m (s) 


/is 

1 + fis 


where fi > 0 is a small constant. It is clear that for low frequencies \A m (joj) \ = O(n) 
and \A m (juj)\ — > 1 as u> — > oo. Because 


|A m (jw 0 )| 


\/l + /i 2 wg 


it follows that for to o = we have A m (ju>o)| = whereas for u>o < j; we have 
\A rn (joJo) = O(n). Therefore for accurate parameter identification, the input signal 
should be chosen to be PE but the PE property should be achieved with frequencies 
that do not excite the unmodeled dynamics. For the above example of A m (s), 
u = Uq does not excite A m (s) at all, i.e. , A m (0) = 0, whereas for u = sin LUot with 
wo ^ the excitation of A m (s) is small leading to an 0(/x) steady state error for 

\0\- V 


In the following section we define the class of excitation signals that 
guarantee PE but with frequencies outside the range of the high frequency 
unmodeled dynamics. 

9.2.1 Dominantly Rich Signals 

Let us consider the following plant: 


y = G 0 (s)u + A a (s)u 


(9.2.7) 
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where G'o(s), A a (s) are proper and stable, A a (s) is an additive perturbation 
of the modeled part Go(s). We like to identify the coefficients of Gq(s) by 
exciting the plant with the input u and processing the I/O data. 

Because A a (s)u is treated as a disturbance, the input u should be chosen 
so that at each frequency uii contained in u, we have |Go(jA*) | 3> \A a (jcoi)\. 
Furthermore, u should be rich enough to excite the modeled part of the 
plant that corresponds to Gq(s) so that y contains sufficient information 
about the coefficients of G'o(s). For such a choice of u to be possible, the 
spectrums of Gq(s) and A a (s) should be separated, or |G'o(jA) | 2> |A a (jcj)| 
at all frequencies. If Go(s) is chosen properly, then |A a (jcj)| should be small 
relative to \Go(ju) | in the frequency range of interest. Because we are usually 
interested in the system response at low frequencies, we would assume that 
\Go(juj)\ 3> \A a (jijj)\ in the low-frequency range for our analysis. But at high 
frequencies, we may have \Go(ju) | being of the same order or smaller than 
|A a (jA)|. The input signal u should therefore be designed to be sufficiently 
rich for the modeled part of the plant but its richness should be achieved 
in the low-frequency range for which |G'o(jA) | 3> \A a (juj)\. An input signal 
with these two properties is called dominantly rich [90] because it excites 
the modeled or dominant part of the plant much more than the unmodeled 
one. 


Example 9.2.3 Consider the plant 


b 1 — 2ys 

V = u 

s + a 1 + jj,s 


(9.2.8) 


where y > 0 is a small constant and a > 0, b are unknown constants. Because y is 
small, we choose Gq(s) = as the dominant part and rewrite the plant as 


V = 


s + a 


1 - 


3 ys 


where 


1 + ys 

A a {ys,s) = - 


s + a 


u + A a (ys, s)u 


3bys 


(1 + ys)(s + a) 

is treated as an additive unmodeled perturbation. Because 


|G 0 (jA 0 )| 2 = 


b 2 

uJq + a 2 


\A a {jyuJoJuo )\ 2 = 


% 2 y 2 ujQ 
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it follows that |Go(ja>o)| \A a (jfiu)o, jo>o)| provided that 

« 75 

Therefore, the input u = sinu^t qualifies as a dominantly rich input of order 2 for 
the plant (9.2.8) provided 0 < wo < 0(1/ fi) and is small. If coq = I, then 

l<?o(jw 0 )| 2 = 2 2 i i = ^ 0 i2 ) 

[i-a z + 1 

and 

07,2 2 

\A a (j^ 0 ,jco 0 )\ 2 = ^ > |G 0 (jw 0 )| 2 

Hence, for u > o > 0(1), the input u = sin(o>ot) is not dominantly rich because it 
excites the unmodeled dynamics and leads to a small signal to noise ratio. 

Let us now examine the effect of the dominantly rich input u = sin(u>ot), with 
0 < w 0 < 0(1/ n), on the performance of an identifier designed to estimate the 
parameters a,b of the plant (9.2.8). Equation (9.2.8) can be expressed as 

2 = r T <^ + ? ? (9.2.9) 


where 


z = —^y, 9 = 

s + A 


1 1 

v 


-iT 


s T A s T A 


, e* = [b,a ] n 


A > 0 is a design parameter and 


V = 


3b/j,s 

(s + A)(l + [is ) 


is the modeling error term, which is bounded because u £ C and A,/x > 0. The 
gradient algorithm for estimating 9* when r] = 0 is given by 


6 = Tecj> 

€ = z — Z, Z = 6 T (j) 


(9.2.10) 


The error equation that describes the stability properties of (9.2.10) when applied 
to (9.2.9) with 77 ^ 0 is given by 

9 = -TcjxtAe + Tfo! (9.2.11) 


Because </> £ it follows from the results of Chapter 4 that if (f) is PE with level 
ao > 0, then the homogeneous part of (9.2.11) is exponentially stable which implies 


that 


lim sup |6*(r)| < sup \<j)(t)r](t)\ 
t ~> 00 T >t Po t 


(9.2.12) 
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where c = ||r|| and (3q depends on T, ou in a way that as «o — ► 0, /3q — * 0 (see proof 
of Theorem 4.3.2 in Section 4.8). Therefore, for a small error bound, the input u 
should be chosen to guarantee that <j> is PE with level «o > 0 as high as possible 
(without increasing sup 4 \<j>(t)\) and with sup t \r](t)\ as small as possible. 

To see how the unmodeled dynamics affect the dominant richness properties of 
the excitation signal, let us consider the input signal u = sinojot where u>o = 0(1). 
As discussed earlier, this signal u is not dominantly rich because it excites the high- 
frequency unmodeled dynamics. The loss of the dominant richness property in this 
case can also be seen by exploring the level of excitation of the signal <f>. If we 
assume zero initial conditions, then from the definition of <f>, we can express <f> as 


Ai(cj o) sin(w 0 f - 
A 2 (lu q) sin(w 0 f - 


■ <Pi) 

■ <P2) 


where 


Ai(wo) = 


A 2 (u>o) — 


HvT+V^ 


x/wq + A 2 + Wo + A 2 )(w| + a2 )( 1 + ++) 

and p 1 , p 2 also depend on ojq. Thus, we can calculate 




4>(t)(J) T (t)cIt = — 

U)q 


A\ A 1 A 2 cos(t/?i - p 2 ) 

A 1 A 2 cos((^i - p 2 ) 


Al 


(9.2.13) 


For any t > 0 and T > , it follows from (9.2.13) that 


1 

T 


rt+T 


(/)(t)<P t (t)cLt = 


r n T - 1 ,t+(*+l))£ 


T 


E 

2=0 


dr - 


lt+i — 


rt+T 


/ t-\-riT 


dr 


< 1 / n T TT | 7 r ^ ' A\ A 1 A 2 cos(p 1 - p 2 ) 

~ T\ ujq woy. AiA 2 cos(p\ — p 2 ) 


Al 


where nr is the largest integer that satisfies nr < ++■ By noting that < \ 
and < 2 , it follows that 


1 

T 


rt+T 

J (f>(T)<j) T (t)(1t < 


< a 


A\ AxA 2 cos(p! - p 2 ) 

A 1 A 2 cos(p! - p 2 ) 

Af 0 
0 Al 


All 


for any constant a > 2. Taking w 0 = ^ for some constant c > 0 (i.e., u 0 = O+) ), 
we have 


Ai 


9 

x/c 2 + A 2 /x 2 


A 2 


|6|x/l + 4c 2 + 

+J+++++++++L+R )(r+4)2y 
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which implies that 

t+T 

<^(r)0 T (r)dr < 0(y 2 )I (9.2.14) 

for all y £ [0, y*] where y* > 0 is any finite constant. That is, the level of PE of f> is 
of 0{y 2 ). As /i — > 0 and ui o — > oo, we have if // +T (t)cIt — > 0 and the level 

of PE vanishes to zero. Consequently, the upper bound for linp^^ sup r>t |0 (t)| 
given by (9.2.12) will approach infinity which indicates that a quality estimate for 
9* cannot be guaranteed in general if the nondominantly rich signal u = sinuot 
with wo = O(-) is used to excite the plant. 

On the other hand if u = sinu>ot is dominantly rich, i.e., 0 < |w'o < O(-), then 
we can show that the level of PE of <t> is strictly greater than O(y) and the bound 
for sup t |r?(t)| is of O(fi) (see Problem 9.1). V 



Example 9.2.3 indicates that if the reference input signal is dominantly 
rich, i.e., it is rich for the dominant part of the plant but with frequencies 
away from the parasitic range, then the signal vector cf in the adaptive law 
is PE with a high level of excitation relative to the modeling error. This 
high level of excitation guarantees that the PE property of <f{t) cannot be 
destroyed by the unmodeled dynamics. Furthermore, it is sufficient for the 
parameter error to converge exponentially fast (in the case of the gradient 
algorithm (9.2.10)) to a small residual set. 

Let us now consider the more general stable plant 

y = G 0 (s)u + yA a (ys,s)u + d (9.2.15) 

where Go(s) corresponds to the dominant part of the plant and yA a (ys,s) 
is an additive perturbation with the property lim^o /uA a (/xs, s) = 0 for 
any given s and d is a bounded disturbance. The variable y > 0 is the 
small singular perturbation parameter that can be used as a measure of the 
separation of the spectrums of the dominant dynamics and the unmodeled 
high frequency ones. 

Definition 9.2.1 (Dominant Richness) A stationary signal u : TZ + 1Z 

is called dominantly rich of order n for the plant (9.2.15) if (i) it consists of 
distinct frequencies u) 2 , ■ ■ ■ ,ujn where N > (ii) satisfies 


coi\ < O 


1 

9 


0Ji- w>k\> O(y), for i k; i, k = 1 , . . . , N 
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and (in) the amplitude of the spectral line at cut defined as 

f u (vi) = lim ^ f u(r)e~ juJiT dT Vi > 0 
7 1 — >oo 1 Jt 

satisfies 

\fu(wi)\>0(n) + 0(d), i = 1,2, . . . , N 

For example, the input u = A\sinuj\t + A 2 sinw 2 t is dominantly rich of 
order 4 for the plant (9.2.15) with a second order Go(s) and d = 0 provided 
\ui - u 2 \ > 0(h); |wi|, \u 2 \ < O(i) and |Ai|, \A 2 \ > 0(h). 


9.2.2 Robust Parameter Identifiers 

Let us consider the plant 

y = Go(s)(l + H^m(Hs, s))u = Go(s)u + //A a (//s, s)u 

where 


(9.2.16) 


G 0 (s) = 


b m S rn + bm-iS 171 + • • • + bo _ Zp(s ) 


,A a (Hs,s) = G 0 (s)A m (Hs,s) 


s n +a n -\s n 1 H hao R P (s) 

and G(fis) is the transfer function of the modeled or dominant part of the 
plant, A m (ns,s) is a multiplicative perturbation that is due to unmodeled 
high frequency dynamics as well as other small perturbations; h > 0 and 
lirn^o /uA a (/xs, s) = 0 for any given s. The overall transfer function is 
assumed to be strictly proper with stable poles. Our objective is to identify 
the coefficients of Gq(s) despite the presence of the modeling error term 
HA a (ns,s)u. 

As in the ideal case, (9.2.16) can be expressed in the linear parametric 
form 

(9.2.17) 


z = 0* T 4> + HV 


where 0* = [6 m , . 

. . ,b 0 . 

) tin— 1 ) • • • 

,a 0 ] T , 

s n 

A\ — 

«m( s ) 

1 

Z ~ 

cp — 

Ms) ’ 

A(s 


-|T 


-y 


n = 


z p(s) 

A(s) 


A m (HS,s)u 


cti(s) = [s l , s* 1 , . . . , 1] T and A(s) is a Hurwitz polynomial of degree n. As 
shown in Chapter 8, the adaptive law 


= Tefi, 


e = z — z, z = 


(9.2.18) 
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developed using the gradient method to identify 9* on-line may not guarantee 
the boundedness of 9 for a general class of bounded inputs u. If, however, 
we choose u to be dominantly rich of order n + m + 1 for the plant (9.2.16), 
then the following theorem establishes that the parameter error 9{t ) — 9* 
converges exponentially fast to a residual set whose size is of 0(/i). 

Theorem 9.2.1 Assume that the nominal transfer function Gq(s) has no 
zero-pole cancellation. If u is dominantly rich of order n + m + 1 , then there 
exists a n* > 0 such that for p £ [0,/a*), the adaptive law (9.2.18) or any 
stable adaptive law from Tables 4-2, 4-3> an d 4-5, with the exception of the 
pure least-squares algorithm, based on the parametric model (9.2.17) with 
r] = 0 guarantees that e, 9 £ Coo and 9 converges exponentially fast to the 
residual set 

IZ e = {9 | \9-9*\ < cp) 
where c is a constant independent of fi. 


To prove Theorem 9.2.1 we need to use the following Lemma that gives 
conditions that guarantee the PE property of <f for p ^ 0. We express the 
signal vector cj) as 

4> = Ho(s)u + fiHi^pLs, s)u (9.2.19) 

where 


H 0 (s) 
H 1 (/j,s, s) 


1 


A(s) 

1 


A(-) 


«m(s),-«n-l('S)G' 0 (s)] 

0, . . . , 0, -a^_ 1 (s)G 0 (s)A rn (ij,s, s) 


Lemma 9.2.1 Let u : IZ + R be stationary and Hq(s), Hi(/j,s,s) satisfy 
the following assumptions: 


(a) The vectors Ho(juji), Ho(ju 2 ), ■■■, Ho(jLOn) are linearly independent 
on C n for all possible uj\ , u) 2 , ... , uJn £ H, where n = n + m + 1 and 
Vi / v k for i / k. 


(b) For any set {v\,U 2 , ■ ■ ■ ,ws} satisfying \ uii — 0Jk\ > 0(/a) for i / k and 
\ui\ < we have \ det(^)| > 0(/T) where H = [Ho(ju>i), Ho(ju 2 ), 

■ ■ -,Ho(jVn)]. 
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(c) \Hi(jfMjj,juj)\ < c for some constant c independent of p and for all 
oj G 1Z. 

Then there exists a p* > 0 such that for p G [0, p*), <f> is PE of order n with 
level of excitation a \ > O(p) provided that the input signal u is dominantly 
rich of order n for the plant (9.2.16). 

Proof of Lemma 9.2.1: Let us define 

fo = H 0 (s)u, pi = H 1 (ps,s)u 

Because po is the signal vector for the ideal case, i.e., H 0 (s) does not depend on p, 
u being sufficiently rich of order n together with the assumed properties (a), (b) of 
H 0 (s) imply, according to Theorem 5.2.1, that po is PE with level «o > 0 and ao 
is independent of p, i.e., 


t+T 

Po(t)P o {r)dr > a 0 I (9.2.20) 

Vf > 0 and some T > 0. On the other hand, because Hi(ps,s) is stable and 
\Hi(jpu>,ja>)\ < c for all w € 1Z, we have <j> i € Coo and 



1 

T 


rt+T 


( r )dr < pi 


(9.2.21) 


for some constant P which is independent of p. Note that 


<t>( T )<t> T ( T )dr = - J (Po(t) + pfPr))^ (t) + pfj (r))d 

t+T M T )<t>o(T) 


1 

> — 

- j, 


r t+T 


-dr — p 


( T )dr 


where the second inequality is obtained by using (x + y){x + y) T 
obtain that 


1 

T 


rt+T 




<t>{T)<t> (r)dr > —I - p pi 


\ XX 

— 2 


yy 


we 


which implies that p has a level of PE aq 


9 f-, say, for p G [0, p*) where p* 


A 



Lemma 9.2.1 indicates that if u is dominantly rich, then the PE level of 
po, the signal vector associated with the dominant part of the plant, is much 
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higher than that of fief) i, the signal vector that is due to the unmodeled part 
of the plant, provided of course that fi is relatively small. The smaller the 
parameter /z is, the bigger the separation of the spectrums of the dominant 
and unmodeled high frequency dynamics. 

Let us now use Lemma 9.2.1 to prove Theorem 9.2.1. 

Proof of Theorem 9.2.1 The error equation that describes the stability properties 
of the parameter identifier is given by 

9 = -T^d + nT<jrq (9.2.22) 


where 9 = 9—9* is the parameter error. Let us first assume that all the conditions of 
Lemma 9.2.1 are satisfied so that for a dominantly rich input and for all /i £ [0, /z*), 
the signal vector <fr is PE with level a\ > 0(/z). Using the results of Chapter 4 we 
can show that the homogeneous part of (9.2.22) is u.a.s., i.e. , there exists constants 
a > 0,/3 > 0 independent of /z such that the transition matrix 4>(f, to) of the 
homogeneous part of (9.2.22) satisfies 

||4>(f,fo)|| < /3e _Q(t_to) (9.2.23) 

Therefore, it follows from (9.2.22), (9.2.23) that 

\9(t)\ < ce~ at + fie, V^z € [0, n*) 

where c > 0 is a finite constant independent of /z, which implies that 9,e £ Coo and 
9(t) converges to the residual set TZ e exponentially fast. 

Let us now verify that all the conditions of Lemma 9.2.1 assumed above are 
satisfied. These conditions are 

(a) Ho(juJi ), . . . , Ho(jcun) are linearly independent for all possible u>i,u> 2 , . . . ,u% 

where u>i ^ Uk, i, k = 1, . . . , n; n = n + m + 1 

(b) For any set {u>\, 0 J 2 , ■ ■ ■ , where \u>i — u>k\ > 0(/z),z k and |o>j| < 0(1), 

we have 

\det{[Ho(jui), H 0 (ju> 2 ), ■ ■ ■ , H 0 (ju n )]}\ > 0(/z) 

(c) \Hi(/j,ju,juj)\ < c for all u> € TZ 

It has been shown in the proof of Theorem 5.2.4 that the coprimeness of the 
numerator and denominator polynomials of Gq(s) implies the linear independence 
of H 0 (jcJi),H 0 (ju! 2 ), • • • , H 0 (juJn) for any Ui,u 2 , . . . , with ^ uj k and thus (a) 
is verified. From the definition of F/i(/zs, s) and the assumption of Go(s)A m (/zs, s) 
being proper, we have \Hi(njco, ju) \ < c for some constant c, which verifies (c). 
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To establish condition (b), we proceed as follows: From the definition of Hq(s), 
we can write 


Hq(s) — Qo 


A (s)R p (s) 


where Q o € lZ n xn is a constant matrix. Furthermore, Q o is nonsingular, otherwise, 
we can conclude that H 0 (juji), H 0 (ju> 2 ), ■ ■ ■ , H 0 (ju)n) are linearly dependent that 
contradicts with (a) which we have already proven to be true. Therefore, 


'(M )"” 1 (M ) fi_1 
(M)"“ 2 (jcJ2) n ~ 2 


= Q o 


jw 1 ju 2 

1 1 


(J us) 8 " 1 ' 
(MO *- 2 


diag 


ju n 

1 


1 


A {ju>i)Rp(juJi) 


(9.2.24) 


Noting that the middle factor matrix on the right hand side of (9.2.24) is a Van- 
dermonde matrix [62], we have 


det {[Ho(ju !), . . . , i?o(M)]} = det(Qo) 

i=l 


1 


n - m) 

l<i<fc<n 


and, therefore, (b) follows immediately from the assumption that oj,; — Wfc| > O(fT). 

□ 


Theorem 9.2.1 indicates that if the input u is dominantly rich of order 
n + m + 1 and there is sufficient separation between the spectrums of the 
dominant and unmodeled high frequency dynamics, i.e., fj, is small, then the 
parameter error bound at steady state is small. The condition of dominant 
richness is also necessary for the parameter error to converge exponentially 
fast to a small residual set in the sense that if u is sufficiently rich but not 
dominantly rich then we can find an example for which the signal vector (j) 
loses its PE property no matter how fast the unmodeled dynamics are. In 
the case of the pure least-squares algorithm where the matrix T in (9.2.22) 
is replaced with P generated from P = —P^^-P, we cannot establish the 
u.a.s. of the homogeneous part of (9.2.22) even when <fr is PE. As a result, 
we are unable to establish (9.2.23) and therefore the convergence of 8 to a 
residual set even when u is dominantly rich. 
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9.2.3 Robust Adaptive Observers 

In this section we examine the stability properties of the adaptive observers 
developed in Chapter 5 for the plant model 

y = G 0 (s)u (9.2.25) 

when applied to the plant 

y = G 0 {s)(l + ixA m (ys,s))u (9.2.26) 

with multiplicative perturbations. As in Section 9.2.2, the overall plant 
transfer function in (9.2.26) is assumed to be strictly proper with stable 
poles and linr M ^o yGo(s)A m (ns, s) = 0 for any given s. A minimal state- 
space representation of the dominant part of the plant associated with Gq(s) 
is given by 

In— 1 

x a = —a p x a + b p u, x a £ 1Z n 

0 

V = [1,0, ...,0 ]x Q + nr) 

V = G 0 (s)A m (ys,s)u (9.2.27) 

Because the plant is stable and u £ -Coo, the effect of the plant perturbation 
appears as an output bounded disturbance. 

Let us consider the adaptive Luenberger observer designed and analyzed 
in Section 5.3, i.e. , 


x = A(t)x + bp(t)u + K(t)(y — y), x € lZ n 
V = [1,0,..., 0]s (9.2.28) 

where x is the estimate of x a , 

In— 1 

A(t ) = —a p , K(t) = a* — a p 

0 

a p . bp are the estimates of a p ,b p , respectively, and a* £ 7 Z n is a constant 
vector, and such that 

In— 1 

A* = -a* 


0 
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is a stable matrix. It follows that the observation error x = x a — x satisfies 
x = A* x — b p u + a p y + g(ci p — a*)r] (9.2.29) 

where A* is a stable matrix; a p = a p — a p , b p = b p — b p are the parameter 
errors. The parameter vectors a p ,b p contain the coefficients of the denomi- 
nator and numerator of G'o(s) respectively and can be estimated using the 
adaptive law described by equation (9.2.18) presented in Section 9.2.2 or 
any other adaptive law from Tables 4.2, 4.3. The stability properties of 
the adaptive Luenberger observer described by (9.2.28) and (9.2.18) or any 
adaptive law from Tables 4.2 and 4.3 based on the parametric model (9.2.17) 
are given by the following theorem: 


Theorem 9.2.2 Assume that the input u is dominantly rich of order 2 n 
for the plant (9.2.26) and Gq(s ) has no zero-pole cancellation. The adap- 
tive Luenberger observer consisting of equation (9.2.28) with (9.2.18) or any 
adaptive law from Tables f.2 and f.3 (with the exception of the pure least- 
squares algorithm) based on the parametric model (9.2.17) with /i = 0 applied 
to the plant (9.2.26) with /i / 0 has the following properties: There exists a 
g* > 0 such that for all /i £ [0, g*) 


(i) 

(ii) 


All signals are u.b. 


The state observation error x and parameter error 6 converge exponen- 
tially fast to the residual set 


1Z P = 


| x, 0 \x\ + \9\ < c/j j 


where c > 0 is a constant independent of fi. 


Proof: The proof follows directly from the results of Section 9.2.2 where we have 
established that a dominantly rich input guarantees signal boundedness and expo- 
nential convergence of the parameter estimates to a residual set where the parameter 
error 9 satisfies \6\ < eg provided g £ [0,/r*) for some g* >0. Because |6 p |,|o p | 
converge exponentially to residual sets whose size is of O(g) and it, y , g are bounded 
for any g £ [0, /u*), the convergence of x, 9 to the set R e follows directly from the 
stability of the matrix A* and the error equation (9.2.29). □ 
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Theorem 9.2.2 shows that for stable plants, the combination of a state 
observer with an adaptive law from Tables 4.2 and 4.3 developed for modeling 
error and disturbance free parametric models leads to a robust adaptive 
observer provided that the input signal u is restricted to be dominantly 
rich. If u is not dominantly rich, then the results of Theorem 9.2.2 are no 
longer valid. If, however, instead of the adaptive laws of Tables 4.2 and 4.3, 
we use robust adaptive laws from Tables 8.1 to 8.4, we can then establish 
signal boundedness even when u is not dominantly rich. Dominant richness, 
however, is still needed to establish the convergence of the parameter and 
state observation errors to residual sets whose size is of the order of the 
modeling error. As in the case of the identifiers, we are unable to establish 
the convergence results of Theorem 9.2.1 for the pure least-squares algorithm. 

9.3 Robust MRAC 

In Chapter 6, we designed and analyzed a wide class of MRAC schemes 
for a plant that is assumed to be finite dimensional, LTI, whose input and 
output could be measured exactly and whose transfer function Go(s) satisfies 
assumptions PI to P4 given in Section 6.3.1. We have shown that under these 
ideal assumptions, an adaptive controller can be designed to guarantee signal 
boundedness and convergence of the tracking error to zero. 

As we explained in Chapter 8, in practice no plant can be modeled ex- 
actly by an LTI finite-dimensional system. Furthermore, the measurement 
of signals such as the plant input and output are usually contaminated with 
noise. Consequently, it is important from the practical viewpoint to exam- 
ine whether the MRAC schemes of Chapter 6 can perform well in a realistic 
environment where Gq(s) is not the actual transfer function of the plant 
but instead an approximation of the overall plant transfer function, and the 
plant input and output are affected by unknown disturbances. In the pres- 
ence of modeling errors, exact model-plant transfer function matching is no 
longer possible in general and therefore the control objective of zero tracking 
error at steady state for any reference input signal may not be achievable. 
The best one can hope for, in the nonideal situation in general, is signal 
boundedness and small tracking errors that are of the order of the modeling 
error at steady state. 

In this section we consider MRAC schemes that are designed for a sirnpli- 
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fied model of the plant but are applied to a higher-order plant. For analysis 
purposes, the neglected perturbations in the plant model can be character- 
ized as unstructured uncertainties of the type considered in Chapter 8. In 
our analysis we concentrate on multiplicative type of perturbations. The 
same analysis can be carried out for additive and stable factor perturbations 
and is left as an exercise for the reader. 

We assume that the plant is of the form 

Up = G'o(s)(l + A m (s))(up + d u ) (9.3.1) 

where G'o(s) is the modeled part of the plant, A m (s) is an unknown mul- 
tiplicative perturbation with stable poles and d u is a bounded input dis- 
turbance. We assume that the overall plant transfer function and Go are 
strictly proper. This implies that GoA m is also strictly proper. We design 
the MRAC scheme assuming that the plant model is of the form 

y P = G 0 (s)u p , Gq(s) = y (9.3.2) 

where Go(s) satisfies assumptions PI to P4 in Section 6.3.1, but we imple- 
ment it on the plant (9.3.1). The effect of the dynamic uncertainty A m (s) 
and disturbance d u on the stability and performance of the MRAC scheme 
is analyzed in the next sections. 

We first treat the case where the parameters of Go(s) are known exactly. 
In this case no adaptation is needed and therefore the overall closed-loop 
plant can be studied using linear system theory. 

9.3.1 MRC: Known Plant Parameters 

Let us consider the MRC law 

u p = Up + ®zVp + c o r (9.3.3) 

developed in Section 6.3.2 and shown to meet the MRC objective for the 
plant model (9.3.2). Let us now apply (9.3.3) to the actual plant (9.3.1) and 
analyze its properties with respect to the multiplicative uncertainty A m (s) 
and input disturbance d u . 
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Plant 



Figure 9.1 Closed-loop MRC scheme. 


Following the analysis of Section 6.3.2 (see Figure 6.4), the closed-loop 
plant is represented as shown in Figure 9.1 where 


C(s) 


Ms)c*o 

A (s) — ^ T a(s) ’ 


F(s) = 


9% T a(s) + ^3 A(.s) 

cSA(s) 


The closed-loop plant is in the form of the general feedback system in 
Figure 3.2 analyzed in Chapter 3. Therefore using equation (3.6.1) we have 




C -FCG ' 



Up 


1 + FCG 1 + FCG 


r 

. y p . 


CG G 


d u 



[ 1 + FCG 1 + FCG \ 



(9.3.4) 


where G = Go(l + A m ) is the overall transfer function which is assumed 
to be strictly proper. The stability properties of (9.3.4) are given by the 
following Theorem: 


Theorem 9.3.1 Assume that 9* = [#| T , 9^, 9%, Cq] t is chosen so that for 
A m (s) = 0 ,d u = 0, the closed-loop plant is stable and the matching equation 

CGp _ 

1 + FCGo 

where W m (s) = k m is the transfer function of the reference model, is 
satisfied, i.e., the MRC law (9.3.3) meets the MRC objective for the simplified 
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plant model (9.3.2). If 


9fa(s) + 9* 3 A(s) k p 


A W 


Wm(®) ^m(®) 


< 1 


(9.3.5) 


then the closed-loop plant is internally stable. Furthermore there exists a 
constant 5* > 0 such that for any 6 £ [0, <5*] the tracking error e± = y v — y m 
satisfies 

lim sup |ei(r)| < Aro + cdo (9.3.6) 

°° T>t 

where ro, do are upper bounds for |r(t)|, \d u (t)\, c> 0 is a finite constant 


A = 


W m {s)(A{s) - Cf(s))R p (s) 


Z p (s)[k m A(s ) - k p W m (s)Dl(s)A m (s)] 


W m (s)A m (s) 


2 S 


with C((s) = 0J T a(s), £)J(s) = 9^ a{s) + 9 3 A(s) and y m = W m (s)r is the 
output of the reference model. 


Proof A necessary and sufficient condition for r, d u £ to imply that y p , u p £ C ^ 
is that the poles of each element of the transfer matrix in (9.3.4) are in C~ . It can 
be shown using the expressions of F, C and G that the characteristic equation of 
each element of the transfer matrix in (9.3.4) is given by 

(A - Cl)R p - D\k p Z p (\ + A m ) = 0 (9.3.7) 

Using the matching equation (6.3.13), i.e. , (A — C()R P — k p DlZ p = Z p A 0 R m , the 
characteristic equation (9.3.7) becomes 


Z p (A 0 R m - k p D\ A m ) = 0 

where Ao is a factor of A = Ao Z m . Because Z p is Hurwitz, we examine the roots of 

Ao Rm — k p DiA m = 0 

which can be also written as 

1 D\k p A k p DlW m A m _ n 

A 0 Rm m ~ k m A 

Because the poles of are i n C~ , it follows from the Nyquist criterion that 

for all A m satisfying 

k p D\ (a) W m (s) A m (s) 
k m A(s) 


OO 


(9.3.8) 
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the roots of (9.3.7) are in C~ . Hence, (9.3.8), (9.3.4) imply that for r,d u € Coo, we 
have u p , y p e Coo . 

Because F(s ) = — ^^(s) ^ ias stable poles and r,d u ,u p ,y p G Coo, it follows from 
Figure 9.1 that all signals in the closed-loop scheme are bounded. 

The tracking error ei = y p — y m is given by 

e = W m A m G 0 (l + A m ) (9 3 9) 

1 + FCG 0 (1 + A m ) + 1 + FGG 0 (1 + A m ) “ 

Substituting for F, C, Go and using the matching equation, we have 


ei 


C i) R p w A 

Z p (k m A-k p W m D*A m ) m m 


k p {K-Ct)W m {l + A m ) d 
(k m A - kpWmDIAm) 


(9.3.10) 


Due to G(s),Go(s) being strictly proper and the fact that W m (s) has the same 
relative degree as Gq(.s), it follows that W / m (s)A m (s) is strictly proper, which implies 
that the transfer function e\ (s)/r(s) is strictly proper and ei(s)/d u (s) is proper. 
Furthermore, both ei(s) /r(s) , e\(s) / d u (s) have stable poles (due to (9.3.5)), which 
implies that there exists a constant 8 * > 0 such that , TJs) are ana lyhic in 
Re[s] > — Using the properties of the £25 norm, Lemma 3.3.2, (9.3.10) and the 
fact that r,d u € Coo, the bound for the tracking error given by (9.3.6) follows. □ 


Remark 9.3.1 The expression for the tracking error given by (9.3.9) sug- 
gests that by increasing the loop gain FCGq, we may be able to im- 
prove the tracking performance. Because F , C, Gq are constrained by 
the matching equation, any changes in the loop gain must be performed 
under the constraint of the matching equation (6.3.13). One can also 
select A(s) and W m (s) in an effort to minimize the H 0 Q norm in the 
C- 2 e bound for e\, i.e., 

W m A m Go(l + A m ) 

1 + FCG 0 (1 + A m ) J nll2+ 1+FCG 0 (1 + A m ) J 1,2 

under the constraint of the matching equation (6.3.13). Such a con- 
strained minimization problem is beyond the scope of this book. 

Example 9.3.1 Let us consider the plant 


eit\\2 < 


y = — J— (1 + A m {s))u = G(s)u 
s + a 


(9.3.11) 
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where G(s) is the overall strictly proper transfer function of the plant and A m is 
a multiplicative perturbation. The control objective is to choose u such that all 
signals in the closed-loop plant are bounded and y tracks as close as possible the 
output y rn of the reference model 


where a m ,b m > 0. The plant (9.3.11) can be modeled as 

2/=—"— u (9.3.12) 

S H - CL 

The MRC law based on (9.3.12) given by 

u = 0*y + b m r 

where 9* = a — a m meets the control objective for the plant model (9.3.12) exactly. 
Let us now implement the same control law on the actual plant (9.3.11). The 
closed-loop plant is given by 


^ _ b m ( 1 + A m (s)) 

V s + a m - 9* A m (s) r 

whose characteristic equation is 

s + a m - 9*A m (s) = 0 


^*A m (s) 
s + a m 


Because 9 is strictly proper with stable poles, it follows from the Nyquist 

criterion that a sufficient condition for the closed-loop system to be stable is that 
A m (s) satisfies 


0*A m (s) 



S + a m 

oo 

s + a m 


(9.3.13) 


The tracking error e\ = y — y m satisfies 


b m ( s + a m + 9 *) 

^1 7 | 77 | 77T77 7 7rA m (s)r 

(s + n m ) (s -t- Q m 9 A m (s)) 


Because r € Coo and the transfer function ei(s)/r(s) has stable poles for all A m 
satisfying (9.3.13), we have that 


lim sup|ei(r)| < Ar 0 
t-> °o T > t 
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where |r(i)| < ro and 

A = 

Furthermore, 


(s + a m + 0*) 


eit || 2 < 


s T (5 T cL m 0 A m (s)) 


A m (s) 


28 


(s + a m )(l- 


Ft 2 


(s+a m ) 

Therefore, the smaller the term || ^7^ lloo is, the better the stability margin and 
tracking performance will be. 

Let us consider the case where 


A m (s) 


2/rs 

1 + US 


and fi > 0 is small, that arises from the parameterization of the nonminimum phase 
plant 

2 /is 


1 — /xs 1 

^ (s + a)(l + /j,s) s + a 
For this A m (s), condition (9.3.13) becomes 


1 - 


1 + 


(9.3.14) 


(a — a m ) 2 /is 
(s + a m ) (1 + /xs) 


< 1 


which is satisfied provided 


H < 


1 


2 1 d dm | 

Similarly, it can be shown that A = 0(/x), i.e., the faster the unmodeled pole and 
zero in (9.3.14) are, the better the tracking performance. As /i — > 0, A — > 0 and 
therefore lim^^ sup r >t l e i( r )l 0- V 


9.3.2 Direct MRAC with Unnormalized Adaptive Laws 

The adaptive control schemes of Chapter 6 designed for the simplified plant 
model (9.3.2) can no longer be guaranteed to be stable when applied to the 
actual plant (9.3.1) with A m (s) / 0 or d u / 0. We have already demon- 
strated various types of instability that may arise due to the presence of 
modeling errors using simple examples in Section 8.3. The main cause of in- 
stability is the adaptive law that makes the overall closed-loop plant nonlin- 
ear and more susceptible to modeling error effects. We have already proposed 
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various types of modifications for changing the adaptive laws to counteract 
instabilities and improve performance in Chapter 8. In this section, we use 
some of these modification techniques to improve the robustness properties 
of the direct MRAC schemes with unnormalized adaptive laws developed in 
Section 6.4. 

We start by considering the same plant as in Example 9.3.1, i.e., 

V = — t — ( 1 + A m (s))n (9.3.15) 

s + a 

where a is unknown and u is to be chosen so that the closed-loop plant is 
stable and y tracks the output y m of the reference model 

bm 


as close as possible. The control law based on the modeled part of the plant 
obtained by neglecting A m (s) is given by 

u = 9y + b m r (9.3.16) 

where 9 is to be generated by an adaptive law. Letting 9 = 9 — 9* where 
9* = a — a m is the unknown controller parameter (see Example 9.3.1), we 
write the control law as 


u = 9*y + b m r + 9y 

and use it in (9.3.15) to obtain the closed-loop plant 

y = W(s)(9y + b m r) 


where 


W{s) 


1 + A m (s) 

s + a — 9* — 9*A. m (s) 


1 + A m {s) 
s + a m — 9*A m (s) 


The condition for W(s) to have stable poles is the same as in the non- 
adaptive case (i.e., 0 = 0) and is given by 


9*A m (s) 

S T dm 


oo 


(9.3.17) 
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Defining the tracking error e± = y — y m , we obtain the error equation 

' b m r 


or 


ei = W(s)6y + ( W{s) - 
ei = W (s)6y + 


s + dj] 
A m (s)(s + a) 


-bm,r 


(s + a m )(s + a m - 0*A m (s)) 

The ideal error equation obtained by setting A m (s) = 0 is 

1 


ei = 


-6y 


(9.3.18) 


s + a m 

which, based on the results of Chapter 4, suggests the adaptive law 


6 = 6 = —jei y 


(9.3.19) 


due to the SPR property of ' . As we showed in Chapter 4, the adaptive 

control law (9.3.16), (9.3.19) meets the control objective for the plant (9.3.15) 
provided A m (s) = 0 . 

The presence of A m (s) introduces an external unknown input that de- 
pends on A m ,r and acts as a bounded disturbance in the error equation. 
Furthermore A m (s) changes the SPR transfer function that relates e± to 6y 
from 5+ x a to W(s) as shown by (9.3.18). As we showed in Chapter 8 , adap- 
tive laws such as (9.3.19) may lead to instability when applied to (9.3.15) due 
to the presence of the disturbance term that appears in the error equation. 
One way to counteract the effect of the disturbance introduced by A m (s) / 0 
and r / 0 is to modify (9.3.19) using leakage, dead zone, projection etc. as 
shown in Chapter 8 . For this example, let us modify (9.3.19) using the fixed 
a-modification, i.e., 

0 = —7 e\y — 7 ad (9.3.20) 

where a > 0 is small. 

Even with this modification, however, we will have difficulty establishing 
global signal boundedness unless W(s) is SPR. Because W(s) depends on 
A m (s) which is unknown and belongs to the class defined by (9.3.17), the 
SPR property of W(s) cannot be guaranteed unless A m (s) = 0. Below we 
treat the following cases that have been considered in the literature of robust 
adaptive control. 
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Case I: W(s) Is SPR— Global Boundedness 

Assume that A m (s) is such that W(s) is guaranteed to be SPR. The error 
equation (9.3.18) may be expressed as 


ei = W(s)(9y + d) 


(9.3.21) 


where d = (l - 't+alf ) b m r is guaranteed to be bounded due to r £ Coo 
and the stability of IT _1 (s) which is implied by the SPR property of W(s). 
The state-space representation of (9.3.21) given by 


e = A c e + B c (9y + d) 
e\ = Cj e 


(9.3.22) 


where Cj (si — A c ) 1 B C = W(s) motivates the Lyapunov-like function 

e T P c e 6 2 

V = — H 

2 2 7 

with P c = Pj > 0 satisfies the equations in the LKY Lemma. The time 
derivative of V along the solution of (9.3.20) and (9.3.22) is given by 

V = —e T qq T e — fee 1 L c e + e\d — 066 


where v c > 0, L c = Lj > 0 and q are defined by the LKY Lemma. We have 


V < —v c \ c \e\ 2 + c|e||d| - a\9\ 2 + a \d\\0* 


where c = |j C T |j , A c = A m in(L c 
subtracting aV, we obtain 


Completing the squares and adding and 


2jV c 2j 


(9.3.23) 


where a = min{ 1 j^,a'y} and X p = A m ax(Pc), which implies that V and, 
therefore, e, 6 , e\ £ Coo and that e\ converges to a residual set whose size 
is of the order of the disturbance term |d| and the design parameter a. If, 
instead of the fixed-u, we use the switching a or the projection, we can verify 
that as A m (s) — > 0, i.e. , d — > 0, the tracking error e\ reduces to zero too. 

Considerable efforts have been made in the literature of robust adaptive 
control to establish that the unmodified adaptive law (9.3.19) can be used to 



660 


CHAPTER 9. ROBUST ADAPTIVE CONTROL SCHEMES 


establish stability in the case where y is PE [85, 86]. It has been shown [170] 
that if VE(s) is SPR and y is PE with level ao > 70^0 + 71 where uq is an 
upper bound for \d(t)\ and 70, 71 are some positive constants, then all signals 
in the closed-loop system (9.3.18), (9.3.19) are bounded. Because 00,^0 are 
proportional to the amplitude of r, the only way to generate the high level 
ao of PE relative to uq is through the proper selection of the spectrum 
of r. If A m (s) is due to fast unmodeled dynamics, then the richness of r 
should be achieved in the low frequency range, i.e., r should be dominantly 
rich. Intuitively the spectrum of r should be chosen so that \W (jto) | 3> 
\W(jlo) — | for all lo in the frequency range of interest. 

An example of A m (s) that guarantees W(s) to be SPR is A m (s) = 
where y > 0 is small enough to guarantee that A m (s) satisfies (9.3.17). 


Case II: VE(s) Is Not SPR— Semiglobal Stability 


When W(s) is not SPR, the error equation (9.3.18) may not be the appro- 
priate one for analysis. Instead, we express the closed-loop plant (9.3.15), 
(9.3.16) as 

V = — f— ( 0*y + b m r + Oy ) H ]— A m (s)u 

s + a s + a 

and obtain the error equation 


ei = — (6y + A m (s)u) 

s + a m 

or 

ei = -a m e 1 + 6y + r) 

r) = A m (s)u = A m (s)(9y + b m r ) 

Let us analyze (9.3.24) with the modified adaptive law (9.3.20) 


(9.3.24) 


6 = — yei y — 7 a6 


The input y to the error equation (9.3.24) cannot be guaranteed to be 
bounded unless u is bounded. But because u is one of the signals whose 
boundedness is to be established, the equation y = A m (s)u has to be ana- 
lyzed together with the error equation and adaptive law. For this reason, we 
need to express y = A m (s)u in the state space form. This requires to assume 
some structural information about A m (s). In general A m (s) is assumed to 
be proper and small in some sense. A m (s) may be small at all frequencies, 
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i.e., A m (s) = fj,Ai(s) where n > 0 is a small constant whose upper bound 
is to be established and Ai(s) is a proper stable transfer function. A m (s) 
could also be small at low frequencies and large at high frequencies. This is 
the class of A m (s) often encountered in applications where A m (s) contains 
all the fast unmodeled dynamics which are usually outside the frequency 
range of interest. A typical example is 

A m (s) = (9.3.25) 

which makes (9.3.15) a second-order nonminimum-phase plant and W(s) in 
(9.3.18) nonminimum-phase and, therefore, non-SPR. To analyze the stabil- 
ity of the closed-loop plant with A m (s) specified in (9.3.25), we express the 
error equation (9.3.24) and the adaptive law (9.3.20) in the state-space form 

ei = — a m ei + By + rj 

M r) = -rj - 2 fiu (9.3.26) 

9 = — yei y — 7 ad 

Note that in this representation n = 0 =$■ r) = 0, i.e., the state rj is a 
good measure of the effect of the unmodeled dynamics whose size is charac- 
terized by the value of /j. The stability properties of (9.3.26) are analyzed 
by considering the following positive definite function: 

V(ei,ri,e) = ^ + ^- + |(r/ + ei ) 2 (9.3.27) 

For each fi > 0 and some constants cq > 0, a > 0, the inequality 

V(ei ,rj,9) < co^ 2a 

defines a closed sphere £(//, a, co) in 7Z 3 space. The time derivative of V 
along the trajectories of (9.3.26) is 

= —a m ef - <j 98 - rj 2 + /i[ei - 2 u] (rj + e\ ) 

„ a'2 

< -a m e 1 rj 2 -| f nc[e\+\e\\ 3 +e\+e\\9\ + e\0 2 + \ei\6 2 

+ \e\\\6\ + |ei||( 9 ||? 7 | + |ei| 3 |r/| + |ei| + \r]\ + if + \ef\ri\ 

+ \v\\^\ + \v\9 2 + ef|ry| + |ei0 2 ||?y| + \9\if + |ei||r| + |?y||r|] (9.3.28) 


V 
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for some constant c > 0, where the last inequality is obtained by substituting 
for ei , u = Oy + 0y + b m f and taking bounds. Using the inequality 2 a/3 < 
a 2 + /3 2 , the multiplicative terms in (9.3.28) can be manipulated so that after 
some tedious calculations (9.3.28) is rewritten as 


V < - 


~ e? 


a m e 2 ad 2 rj 2 


V 


\ - R c {\0\ + 1) 


y - ^c(e? + |ei| + \6\ + 0 2 + \rj\ + |ei||? 7 | + 1) 


- 0 2 


a 


- - /zc( |ei| + 1 + \ei 


i . i2 u \ 0*\ 2 , . 

+ /rc|r| + nc H — — (9.3.29) 


Inside £(//, a, Co), |ei|,|0| can grow up to 0(/i _Q ) and |?/| can grow up to 
0(/i” 1//2 ~ a ). Hence, there exist positive constants ki, k 2 ,k^ such that inside 
a, Co), we have 

|ei| < kifi~~ a , \6\ < k 2 lT a , 1 7/| < k 3 fi~ 1/2 ~ a 
For all ei,r),0 inside C(n,a,co) , (9.3.29) can be simplified to 


T r a m 2 T G 7\2 2 o 


1— a 


_ 2 / (hn _ 1/2— 2a 

ll 2 


2 

“ ““A ) - 02 ( t - Al/J 


1— a 


(9.3.30) 


+/rc|r| + /uc + 


(T0 : 


1*2 


for some positive constants fti, fl 2 , Ps- If we now fix a > 0 then for 0 < a < 
1/4, there exists a fi* > 0 such that for each /j, e (0, /i*] 

y > /i 1/2 - 2Q /3!, ^ > /^3M 1_a 

Hence, for each /i e (0, //] and ei, ?/, 9 inside £(/r, a, co), we have 

U<-— ei-y--0 + — + Hn +^ c 
On the other hand, we can see from the definition of V that 


(9.3.31) 
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where C 4 = max{ 1 + 2>1 , ^,2^}. Thus, for any 0 < /3 < I/C 4 , we have 

q-Q*2 

V < -/3V H h iic\r\ 2 + /ic 

Because r, r are uniformly bounded, we define the set 


A 


Do(li) = < ei, 6», 77 


V(e u ri,e) < 


cr9* 2 


+ fic\r\ 2 + 11 c 


which for fixed a and a and for sufficiently small 11 is inside £(/i, a, co). Out- 
side Do(fi) and inside £(n, a, co), V <0 and, therefore, V (ei, 77, 9) decreases. 
Hence there exist positive constants ci,C2 such that the set 


D(v) = {ei, 77, 9 |ei| + \9\ < a fi “, \r}\ < c 2 ii a 1/2 } 


is inside £(/i,a,co ) and any solution ei (t),rj(t),6(t) which starts in D(fi) 
remains inside £(/i, a, co). Furthermore, every solution of (9.3.26) that starts 
from D(n)\Do(/i ) will enter Do(/i ) at t = ti for some finite time t\ and 
remain in Dq(ii) thereafter. Similarly, any solution starting at t = 0 from 
D()(n) will remain in Do(fi) for all t > 0. 

The stability results obtained in this example are semiglobal in the sense 
that as 11 — > 0 the size of D(/i) becomes the whole space. 

The above analysis demonstrates that the unnornralized adaptive law 
with (7-modification guarantees a semiglobal boundedness result in the pres- 
ence of fast unmodeled dynamics. The speed of unmodeled dynamics is 
characterized by the positive scalar fi > 0 in such a way that, as n — ► 0, the 
unmodeled dynamics become infinitely fast and reduce to their steady-state 
value almost instantaneously. The region of attraction from which every so- 
lution is bounded and converges to a small residual set becomes the whole 
space as fi — > 0. 

A similar stability result is established for the general higher order case 
in [85, 86] using the fixed u-modification. Other modifications such as dead 
zone, projection and various other choices of leakage can be used to estab- 
lish a similar semiglobal boundedness result in the presence of unmodeled 
dynamics. 



664 


CHAPTER 9. ROBUST ADAPTIVE CONTROL SCHEMES 


Case III: W(s) Is Not SPR— Method of Averaging: Local Stability 

The error system (9.3.18), (9.3.19) is a special case of the more general error 
system 

ei = W(s)6 t lu + d (9.3.32) 

9 = -7ei u (9.3.33) 

that includes the case of higher order plants. In (9.3.32), W(s) is strictly 
proper and stable and d is a bounded disturbance. The signal vector oj £ IZ 2n 
is a vector with the filtered values of the plant input and output and the 
reference input r. For the example considered above, to = y. In the error 
system (9.3.32), (9.3.33), the adaptive law is not modified. 

With the method of averaging, we assume that the adaptive gain 7 > 0 is 
sufficiently small (slow adaptation) so that for ei,a; bounded, 9 varies slowly 
with time. This allows us to approximate (9.3.32) with 

ei ~ 9 t W(s)u) + d 

which we use in (9.3.33) to obtain 

9 = —'yLo(W(s)u}) T 9 — 'ycud (9.3.34) 

Let us consider the homogeneous part of (9.3.34) 

6 = -7 R(t)9 (9.3.35) 

where R(t) = cu(t)(IF(s)a;(t)) T . Even though R(t ) may vary rapidly with 
time, the variations of 9 are slow due to small 7 . This fact allows us to 
obtain stability conditions for (9.3.35) in terms of sample averages 

Ri{Ti) = f R(t)dt , U = L-i + Ti 
Jti-i 

over finite sample intervals 0 <T % < 00 . We consider the case where R{t) = 
R(t + T) is bounded and periodic with period T > 0. By taking X) = X, the 
value of R = Ri(T ) is independent of the interval. It has been established 
in [3] that there exists a constant 7 * > 0 such that (9.3.35) is exponentially 
stable for 7 e ( 0 , 7 *) if and only if 


minReAj(Ii) > 0, i = 1, 2, . . . , 2n 


(9.3.36) 
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where R = ^ R(t)dt. Condition (9.3.36) is equivalent to the existence of 
a matrix P = P T > 0 such that 


PR + R 1 P > 0 

When the above inequality is satisfied for P = I, namely, 

R+R t > 0 (9.3.37) 

its meaning can be explained in terms of the frequency content of ui by letting 

00 2iri 

w= E Qi^ Uit f “i = T 

i=— oo 

where Q-i is the conjugate of Qi. Because 

OO 

W(s)u= Y QiW(M)e juJit 

i— — oo 

it follows that 

1 pT 00 

R=f Lu(t)(W(sMt)) T dt = Y WijuJQiQli (9.3.38) 

0 i=— oo 

For R to be nonsingular, it is necessary that 

oo 

E QiQ-i > 0 

i=— oo 

which can be shown to imply that uj must be PE. Applying the condition 
(9.3.37) to (9.3.38) we obtain a signal dependent “average SPR” condition 

OO 

E Re{WCM)}Re{Q;QT } > 0 (9.3.39) 

i=— oo 

Clearly, (9.3.39) is satisfied if co is PE and W(s) is SPR. Condition (9.3.39) 
allows R eW(jiVi) < 0 for some oji provided that in the sum of (9.3.39), the 
terms with R eW(jui) > 0 dominate. 

The exponential stability of (9.3.35) implies that 9 in (9.3.34) is bounded 
and converges exponentially to a residual set whose size is proportional to 
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the bounds for lo and d. Because lo consists of the filtered values of the plant 
input and output, it cannot be shown to be bounded or PE unless certain 
assumptions are made about the stability of the plant, the initial conditions 
ei(O),0(O) and disturbance d. Let co m represent lo when 6 = 0, ei = 0. 
The signal u ; m , often referred to as the “ tuned solution ,” can be made to 
satisfy (9.3.39) by the proper choice of reference signal r. If we express 
lo = LOm + e where e is the state of a state space representation of (9.3.32) 
with the same order as lo and e\ = C T e, then lo is bounded, PE and satisfies 
(9.3.39) provided e is sufficiently small. This can be achieved by assuming 
that the initial conditions e(O),0(O) and disturbance d are sufficiently small 
and the plant is stable. Therefore, the above stability result is local. Even 
though the above results are based on several restrictive assumptions, they 
are very valuable in understanding the stability and instability mechanisms 
of adaptive schemes in the presence of modeling errors. For further details on 
averaging and the extension of the above results to a wider class of adaptive 
schemes, the reader is referred to [3]. 

9.3.3 Direct MRAC with Normalized Adaptive Laws 

The MRAC schemes with normalized adaptive laws discussed in Section 6.5 
have the same robustness problems as those with the unnornralized adap- 
tive laws, i.e., they can no longer guarantee boundedness of signals in the 
presence of unmodeled dynamics and bounded disturbances. For MRAC 
with normalized adaptive laws, however, the robustification can be achieved 
by simply using the certainty equivalence principle to combine the MRC law 
with any one of the robust adaptive laws presented in Section 8.5. The design 
procedure is the same as that in the ideal case, that is, we use the same con- 
trol law as in the known parameter case but replace the unknown controller 
parameters with their on-line estimates generated by a robust adaptive law. 
The stability analysis of the resulting closed-loop adaptive scheme is also 
similar to the ideal case, with a few minor changes that incorporate the 
small-in-the-mean, instead of the C 2 , properties of 9 and e into the analysis. 

In this section, we first use an example to illustrate the design and stabil- 
ity analysis of a robust MRAC scheme with a normalized adaptive law. We 
then extend the results to a general SISO plant with unmodeled dynamics 
and bounded disturbances. 
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Example 9.3.2 Consider the following SISO plant 

V = (1 + A m (s))u (9.3.40) 

s — a 

with a strictly proper transfer function, where a is unknown and A m (s) is a multi- 
plicative plant uncertainty. Let us consider the following adaptive control law 

u = - By (9.3.41) 


6 

e 

<t> 


z — z 
m3 ’ 

1 

S + CLm 


7 > 0 

m 2 = 1 + i 


1 


2 = 


z = y 


s + a n 


(9.3.42) 


where — a m is the desired closed loop pole and 9 is the estimate of 9* = a + a m . 

As we have shown in Chapter 8, when (9.3.41), (9.3.42) designed for the plant 
model 

1 

V = u 

s — a 

is applied to the plant (9.3.40) with A m (s) ^ 0, the plant uncertainty A m (s) 
introduces a disturbance term in the adaptive law that may easily cause 9 to drift 
to infinity and certain signals to become unbounded no matter how small A m (s) 
is. The adaptive control law (9.3.41), (9.3.42) is, therefore, not robust with respect 
to the plant uncertainty A m (s). This adaptive control scheme, however, can be 
made robust if we replace the adaptive law (9.3.42) with a robust one developed by 
following the procedure of Chapter 8 as follows: 

We first express the desired controller parameter 9* = a + a m in the form of a 
linear parametric model by rewriting (9.3.40) as 


z = 9*<j> + y 


where z,<f> are as defined in (9.3.42) and 

1 . . . 

V = ; A m{s)U 

s + a m 

is the modeling error term. If we now assume that a bound for the stability margin 
of the poles of A m (s) is known, that is, A m (s) is analytic in i?e[s] > — for some 
known constant <5o > 0, then we can verify that the signal m generated as 

m 2 = l + ?n s , m s = -S 0 m 3 + u 2 + y 2 , m s { 0) = 0, <5 0 < 2o m (9.3.43) 
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guarantees that rj/m, 4>/m G C oo and therefore qualifies to be used as a normalizing 
signal. Hence, we can combine normalization with any modification, such as leak- 
age, dead zone, or projection, to form a robust adaptive law. Let us consider the 
switcliing-cr modification, i.e., 


9 = 'yecf) — a s "f9 
z — 0 (f) 
m 2 


(9.3.44) 


where cr s is as defined in Chapter 8. According to Theorem 8.5.4, the robust 
adaptive law given by (9.3.43), (9.3.44) guarantees e,em,9,9 € £oo and e,em,9 G 
S ( -^-2 'j . Because from the properties of the £25 norm we have 


\v(t)\ < A 2 |K||2,5o> A 2 = 


1 

s + a m 


A m (s) 


2#o 


and m 2 = 1 + 1| 1 1 + 1| 3/t II 2« 0 > it follows that ^ < A 2 . Therefore, e,em,9 € <S(A|). 

We analyze the stability properties of the MRAC scheme described by (9.3.41), 
(9.3.43), and (9.3.44) when applied to the plant (9.3.40) with A m (s) ^ 0 as follows: 

As in the ideal case considered in Section 6.5.1 for the same example but with 
A rn (s) = 0, we start by writing the closed-loop plant equation as 


V = — (~9y + A m (s)u) = 9y + V> 

s + dm S + a m 

Using the Swapping Lemma A.l and noting that an 2 = 

(9.3.45) that 

y = —9(j) 4 — ( H) + ri 

= em 2 + — - — 9<t> (9.3.46) 

8 + d m 

Using the properties of the £ 25 norm ||(-)t|| 2< 5, which for simplicity we denote by 
|| • ||, it follows from (9.3.46) that 

Nl<||em 2 || + ||-^— lUH^II 

S “h Clm 

for any 0 < 6 < Sq. Because u = —9y and 9 G £00 , it follows that 


u = —9y (9.3.45) 

— 9(f> + 77, we obtain from 


ll«ll < c||y|| < c||er7T. 2 || + c||(90|| 


where c > 0 is used to denote any finite constant. Therefore, the fictitious normal- 
izing signal 


m 2 =l + H 2 + \\y \\ 2 < 1 + c||em 2 || 2 + c||^|| 2 


(9.3.47) 
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We can establish as in Chapter 6 that m/ guarantees that to/to/, 0/m/, r]/mf £ Coo 
by using the properties of the C 28 norm, which implies that (9.3.47) can be written 
as 

m 2 < 1 + c||em?Ti/|| 2 + c||0m/|| 2 < 1 + c\\gmf\\ 2 

.A ~ 2 

where g 2 = e 2 ?n 2 + 9 or 


m 2 < 1 + c / e T ' , g 2 {r)m 2 (r)dr 
Jo 

Applying the B-G Lemma III to (9.3.48), we obtain 


m 2 (<) < 4>(t, 0) + <5 f <3>(i, t)cIt 

Jo 


(9.3.48) 


where 

$(i,r) =e- s ^e c ^ s2(s)ds 

Because the robust adaptive law guarantees that em, 9 £ S( A|), we have g £ 5(A|) 
and 

$(t,r) < 


Hence, for 


cAj < 6 


4>(t, r) is bounded from above by a decaying to zero exponential, which implies 
that uif £ Coo ■ Because of the normalizing properties of m/, we have 4>,y,u £ C^ 
and all signals are bounded. The condition cA^ < S implies that the multiplicative 
plant uncertainty A m (s) should satisfy 


A m (s) 
S 4 “ ^ m 


2 

28 0 


6 

< - 


c 


where c can be calculated by keeping track of all the constants. It can be shown 
that the constant c depends on || jA— ||oo <5 and the upper bound for |0(t)|. 

Because 0 < <5 < <5o is arbitrary, we can choose it to be equal to do- The bound 
for sup 4 \0{t ) | can be calculated from the Lyapunov- like function used to analyze 
the adaptive law. Such a bound, however, may be conservative. If, instead of the 
switching <r, we use projection, then the bound for |0(t)| is known a priori and the 
calculation of the constant c is easier. 

The effect of the unmodeled dynamics on the regulation error y is analyzed as 
follows: From (9.3.46) and m, (j) £ C oo, we have 



rt+T 


2 2 j 

to dr 


r t+T ~2 
9 dr, 


Vi > 0 
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for some constant c > 0 and any T > 0. Then using the m.s.s. property of an, 9, 
we have 

1 r t + T r 

T l V 2 dr < cA| + - 

therefore y £ <S(A|), i.e., the regulation error is of the order of the modeling error 
in m.s.s. 

The m.s.s. bound for y 2 does not imply that at steady state, y 2 is of the order of 
the modeling error characterized by A2. A phenomenon known as bursting, where 
y 2 assumes large values over short intervals of time, cannot be excluded by the 
m.s.s. bound. The phenomenon of bursting is discussed in further detail in Section 

9.4. 

The stability and robustness analysis for the MRAC scheme presented above 
is rather straightforward due to the simplicity of the plant. It cannot be directly 
extended to the general case without additional steps. A more elaborate but yet 
more systematic method that extends to the general case is presented below by the 
following steps: 


Stepl. Express the plant input u and output y in terms of the parameter error 
9. We have 


where y u 


y 

u 


1 

S T 


9y + y 


(s - a)y - A m (s)u 


s — a 
s + a m 


9y + y u 


(9.3.49) 


(q + a m ) ^ 
s+a m 


m 


(s)u. Using the C 25 norm for some S £ (0, 2 a m ) we have 


IMI < c||<fy|| + \\y\\, \\u\\<c\\ey\\ + \\ Vu \\ 


and, therefore, the fictitious normalizing signal m / satisfies 

m) = 1 + IMI 2 + IMI 2 < 1 + c(||%|| 2 + IMI 2 + IMI 2 ) (9.3.50) 


Step 2. Use the swapping lemmas and properties of the C 25 norm to upper 
bound ||0y|| with terms that are guaranteed by the adaptive law to have small in 
m.s.s. gains. We use the Swapping Lemma A. 2 given in Appendix A to write the 
identity 


9y = (1 


S + Qq 


)9y + 


a 0 

5 + QIq 


9y 


— — (9y + 6 y) + 
s + ao 


ao 

S + OfQ 


9y 


(9.3.51) 


where ao > 0 is an arbitrary constant. Now from the equation for y in (9.3.49) we 
obtain 


9y=(s + a m ){ 77 - y) 
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therefore, 

\M\ < ~(\\0y\\ + \\m + a 0 c(\\em 2 W + \\§<f>\\ + |M|) (9.3.52) 

«o 

The gain of the first term in the right-hand side of (9.3.52) can be made small by 
choosing large a®. The m.s.s. gain of the second and third terms is guaranteed by 
the adaptive law to be of the order of the modeling error denoted by the bound 
A 2 , i.e., cm, 9 G <S(A|). The last term has also a gain which is of the order of the 
modeling error. This implies that the gain of \\9y\\ is small provided A 2 is small 
and «o is chosen to be large. 


Step 3. Use the B-G Lemma to establish boundedness. The normalizing prop- 
erties of m/ and 9 £ guarantee that y/mf,(j>/mf,m/mf G C 0 c . Because 

INI < M||y|| + ||A ro (s)|| 00 5||u|| + ||u|| 

it follows that ||y||/m/ G £ 00 • Due to the fact that A m (s) is proper and analytic in 
l?e[s] > — <5o/2, || A m (s)||oo <5 is a finite number provided 0 < S < So- Furthermore, 
IMI/ m / fs ^00 where 

Aoo = (9.3.53) 

S + dm oo g 

and, therefore, (9.3.52) may be written in the form 

\\9y\\ < —{\\9mf\\ + to/) + a 0 c(||eTOTO/|| + ||0m/|| + A^to/) (9.3.54) 

a 0 

Using (9.3.54) and ||iy||/m/ < cAoo, ||»?u||/m/ < cAoo in (9.3.50), we obtain 

To/ <l+c(^+alA 2 00 ')m 2 f +c\\gm. f \\ 2 
V CUn / 
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where g 2 = + an|em| 2 + ck§| 0| 2 and a 0 > 1. For c ( A + «nAM < 1, we have 

mj < c + c\\grrif\\ 2 = c + c f e _ ' 5 ^ _r ^ 2 (r)m 2 (r)dr 

Jo 

Applying the B-G Lemma III, we obtain 

m 2 < ce~ St e c fo° 2 ^ dT +c6 J\- S ^e c I* ~ 92{T)dT ds 

Because 9 , em £ <S(A|), it follows that 

C J g 2 (r)dT < cAl + «?) (t-s) + c 


Hence, for 


cAo + On < 6 


an 


(9.3.55) 


we have 

e - St e c fo §2(T)dT < e~ st 

where a = 5 — cA 2 + a^j which implies that m/ is bounded. The boundedness 
of m/ implies that all the other signals are bounded too. The constant S in (9.3.55) 
may be replaced by <5o since no restriction on 6 is imposed except that 5 £ (0,5o]- 
The constant c > 0 may be determined by following the calculations in each of the 
steps and is left as an exercise for the reader. 


Step 4. Obtain a bound for the regulation error y. The regulation error, i.e., 
y, is expressed in terms of signals that are guaranteed by the adaptive law to be of 
the order of the modeling error in m.s.s. This is achieved by using the Swapping 
Lemma A.l for the error equation (9.3.45) and the equation em 2 = — 9(f>+ri to obtain 
(9.3.46), which as shown before implies that y £ 5(A|). That is, the regulation 
error is of the order of the modeling error in m.s.s. 

The conditions that A m (s) has to satisfy for robust stability are summarized 
as follows: 


where 


-2 + ^ 
U.r\ 


< 1, 


cA 2 ( -^ + af, 
.“o 


< So 


A ^ = 


A m (s) 


oo<5o 


Ao = 


A m (s) 


2<5q 


The constant do > 0 is such that A m (s) is analytic in Re[s] > —do/2 and c denotes 
finite constants that can be calculated. The constant cco > max{l, do/2} is arbitrary 
and can be chosen to satisfy the above inequalities for small A 2 , A m . 
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Let us now simulate the above robust MRAC scheme summarized by the equa- 
tions 


u 

0 


m 


2 


-Qy 

a. a Z — 0(j> 

- a s jO, e = 

m z 

1 1 

— : y, z = y u 

s + dm s + a m 

1 + m s , m s = — S()m s + u 2 + y 2 , m s (0) = 0 


where a s is the switching tr, and applied to the plant 

y = — (1 + A m (s))u 
s — a 

where for simulation purposes we assume that a = 1 and A m (s) = — with 
fj, > 0. It is clear that for /i > 0 the plant is nonminimum phase. Figure 9.2 
shows the response of y(t) for different values of y that characterize the size of 
the perturbation A m (s). For small /z, we have boundedness and good regulation 
performance. As y increases, stability deteriorates and for y = 0.35 the plant 
becomes unstable. V 


General Case 

Let us now consider the SISO plant given by 

y p = G 0 (s)(l + A m (s))(u p + d u ) (9.3.56) 

where 

Go(s) = k p ^ (9.3.57) 

is the transfer function of the modeled part of the plant. The high frequency 
gain k p and the polynomials Z p (s), R p (s) satisfy assumptions PI to P4 given 
in Section 6.3 and the overall transfer function of the plant is strictly proper. 
The multiplicative uncertainty A m (s) satisfies the following assumptions: 

51. A m (s) is analytic in Re[s] > — 5o/2 for some known <5o > 0. 

52. There exists a strictly proper transfer function W (s) analytic in Re[s] > 

— 5q/ 2 and such that W(s)A m (s) is strictly proper. 



M- = 0.23 

M- = 0.2 
!_i = 0.1 

fj. = 0.05 

(j. = 0.0 


Figure 9.2 Simulation results of the MRAC scheme of Example 9.3.2 for 
different //. 

Assumptions SI and S2 imply that A,^, A 2 defined as 

A to = ||W'(s)A m (s)|| oo5o , A 2 = || W(s)A m (s)|| 25o 

are finite constants. We should note that the strict properness of the overall 
plant transfer function and of Gq(s) imply that Go(s)A m (s) is a strictly 
proper transfer function. 

The control objective is to choose u p and specify the bounds for Aoo , A 2 
so that all signals in the closed-loop plant are bounded and the output y p 
tracks, as close as possible, the output of the reference model y m given by 

rir / \ r Z m {s) 

ym = w m (s)r = k m r 
Rm{s) 

for any bounded reference signal r(t). The transfer function W m {s) of the 
reference model satisfies assumptions Ml and M2 given in Section 6.3. 

The design of the control input u p is based on the plant model with 
A m (s) = 0 and d u = 0. The control objective, however, has to be achieved 
for the plant with A m (s) / 0 and d u / 0. 
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We start with the control law developed in Section 6.5.3 for the plant 
model with A m (s) = 0, d u = 0, i.e., 

u p = 8 t lu (9.3.58) 

where 6 = [Oj , dj, # 3 , co] T , oj = , tof y p , r] T . The parameter vector 9 is 

to be generated on-line by an adaptive law. The signal vectors u \ , u >2 are 
generated, as in Section 6.5.3, by filtering the plant input u p and output y p . 
The control law (9.3.58) will be robust with respect to the plant uncertainties 
A m (s),ci u if we use robust adaptive laws from Chapter 8, instead of the 
adaptive laws used in Section 6.5.3, to update the controller parameters. 

The derivation of the robust adaptive laws is achieved by first developing 
the appropriate parametric models for the desired controller parameter vec- 
tor 9* and then choosing the appropriate robust adaptive law by employing 
the results of Chapter 8 as follows: 

We write the plant in the form 


RpVp = kpZ p {l + A m){u p + d u ) (9.3.59) 

and then use the matching equation 

(A - dfa)R p - k p (9fa + A 83 ) Z p = Z p A 0 R m (9.3.60) 

where a = a n - 2 (s ) = [s n ~ 2 , ■ ■ ■ , s, 1] T , (developed in Section 6.3 and given 
by Equation (6.3.12)) satisfied by the desired parameter vector 

9* = [ 8 f, 8 f ,e* 3 , c* 0 \ T 

to eliminate the unknown polynomials R p (s), Z p (s ) from the plant equation 
(9.3.59). From (9.3.59) we have 

(A - 8 fa)R p y p = (A - 9f a) k p Z p (l + A m )(u p + d u ) 


which together with (9.3.60) imply that 

Z p {k p {& 2 ^ ol + A#g) + A 0 R m )y p = (A — 8 \^ a)k p Z p { 1 + A m )(u p + d u ) 

Filtering each side with the stable filter -^r- and rearranging the terms, we 
obtain 

a *T a 


kp + 8 'ij y p + ^y p = kp U p-kpd 

A - Of a 


1 f^p 


+ k v 


A 


(A m { Up + d u ) + d u 
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or 


~^y p + O^jp—u^j— ——W^ypS (A m (u p + d u ) + d u ) 


Because Cg = jp it follows that 

Wm + Vp + 0 ^ Vp ~ Up ) = _c ° yp + Wm ^' no 
A - (9J T a 


(9.3.61) 


where 


Vo = 


A 


_ (A m (rtp -|- du) H- d u ) 


is the modeling error term due to the unknown A m , d u 
As in the ideal case, (9.3.61) can be written as 


w m (s)u p = e* T ct>p - 1 7 


(9.3.62) 


where 


e* = {ef,ef,0i c* 0 } T , ^ = [w m °^-u p , w m °^-y p , w mVp , Vp \ 


\ — 0*~^ ct 

V = W m (s)r jo = — W m (s)[(A m (u p + d u ) + d u )} 

Equation (9.3.62) is in the form of the linear parametric model (8.5.6) con- 
sidered in Chapter 8. 

Another convenient representation of (9.3.61) is obtained by adding the 
term c^y m = c^W m r on each side of (9.3.61) to obtain 


W m (0 1 ju p + e* 2 -■ y p + 6*3 1 2/ p + c* 0 r - u p ) = -c* 0 y p + c* 0 y m + W m (s)y 0 


or 

W m (0* T u ; - up) = -c* 0 e i + lE m ( S )? ?0 

leading to 

ei = W m (s)p*(u p - 0* T v + 77o) (9.3.63) 

where e\ = y p - y m , 


P = 3ir> w = 


a 


a 


T 


A u pi ^ Vpi Vpi r 
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which is in the form of the bilinear parametric model (8.5.15) considered in 
Chapter 8. 

Using (9.3.62) or (9.3.63), a wide class of robust MRAC schemes can be 
developed by simply picking up a robust adaptive law based on (9.3.62) or 
(9.3.63) from Chapter 8 and use it to update Oft) in the control law (9.3.58). 
Table 9.1 gives a summary of MRAC schemes whose robust adaptive laws 
are based on (9.3.62) or (9.3.63) and are presented in Tables 9.2 to 9.4. 

The robust adaptive laws of Tables 9.2 to 9.4 are based on the SPR- 
Lyapunov design approach and the gradient method with an instantaneous 
cost. Additional MRAC schemes may be constructed in exactly the same 
way using the least-squares method or the gradient method with an integral 
cost. The properties of this additional class of MRAC schemes are very 
similar to those presented in Tables 9.2 to 9.4 and can be established using 
exactly the same tools and procedure. 

The following theorem summarizes the stability properties of the MRAC 
scheme of Table 9.1 with robust adaptive laws given in Tables 9.2 to 9.4. 


Theorem 9.3.2 Consider the MRAC schemes of Table 9.1 designed for the 
plant model y p = Go(s)u p but applied to the plant (9.3.56) with nonzero plant 
uncertainties A m (s) and d u . If 

c + a o k A-loJ < 1> c + a o fc ^ (/o + Af) < - (9.3.64) 

where 


• Aj = A 02 and k = n* + 1 for the adaptive law of Table 9.2 

• A i = A 2 and k = n* for the adaptive laws of Tables 9.3, 9.4 


Aoo = ||lT(s)A m (s)|| oo5o 

\A{s)-9* 1 T a{s) 1 h 


Aq2 = 


Ao = 


AW 

A (s) — 9f T a(s) 


L "(A)— 

s + h 0 

4U m (s) A m (s) 

2Sq 


2<5q 


6 € (0,4'o) is such that G 0 x (s) is analytic in Re[s] > —5/2 
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• «o > max{l, <5o/2} is an arbitrary constant 

• ho > 5 q/ 2 is an arbitrary constant 

• c > 0 denotes finite constants that can be calculated and 

(a) fo = <7 in the case of fixed a -modification 

(b) fo = vo in the case of e-modification 

(c) fo = do in the case of dead zone modification 

(d) fo = 0 in the case of switching a -modification and projection 


Then all the signals in the closed-loop plant are bounded and the tracking 
error e\ satisfies 


1 

T 



e\dr < c( A 2 + dg + fo) + 


c 

T’ 


\/t > 0 


and for any T > 0, where do is an upper bound for \d u \ and A 2 = 1/ag + 
+ A 2 + Aq 2 for the scheme of Table 9.1 with the adaptive law given in 
Table 9.2 and A 2 = A 2 for the scheme of Table 9.1 with adaptive laws given 
in Tables 9.3, 9.f. 

If, in addition, the reference signal r is dominantly rich of order 2 n 
and Z p , R p are coprime, then the parameter error 9 and tracking error e\ 
converge to the residual set 


S = 




e IZ 2n ,e 1 € n 


\0\ + | ei | < c(/o + A + do) | 


where /o,A are as defined above. The convergence to the residual set S is 
exponential in the case of the scheme of Table 9.1 with the adaptive law given 
in Table 9.f. 


Outline of Proof The main tools and Lemmas as well as the steps for the 
proof of Theorem 9.3.2 are very similar to those in the proof of Theorem 6.5.1 
for the ideal case. A summary of the main steps of the proof are given below. 
The details are presented in Section 9.8. 

Step 1. Express the plant input and output in terms of the parame- 
ter error term 0 T (v. In the presence of unmodeled dynamics and bounded 
disturbances, the closed-loop MRAC scheme can be represented by the block 
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Table 9.1 Robust MRAC Schemes 


Actual plant 

Up = k PR p p \ s j (1 + A m (s))(u p + d u ) 

Plant model 

n. — U Z PU) y 

Up ~ R p (s) a P 

Reference 

model 

Urn = w m (s)r = k m r 

Control law 

chi = Fw! + gu v 

Co 2 = Fuj 2 + gy p 

Up = 9 T LU 

0 = [0i,0j,03 ,C 0 } T ,UJ = [uj ,ujJ ,y p ,r\ T 

u i €n n ~ 1 i i = l,2 

Adaptive law 

Any robust adaptive law from Tables 9.2 to 9.4 

Assumptions 

(i) Plant model and reference model satisfy assump- 
tions PI to P4, and Ml and M2 given in Section 
6.3.1 for the ideal case; (ii) A m (s) is analytic in 
Re[s] > —5q/2 for some known 5q > 0; (iii) overall 
plant transfer function is strictly proper 

Design 

variables 

W m , lF m 1 and where A(s) = det(s/ — F) are 

designed to be analytic in Re[s] > — 5q/2 


diagram shown in Figure 9.3. From Figure 9.3 and the matching equation 
(6.3.11), we can derive 


Up = W m (r + +Vy 

V c o / 

u p = G^Wm (r + \ 0 T uA + rj u 
V c o / 


(9.3.65) 
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Table 9.2 Robust adaptive laws based on the SPR-Lyapunov 
approach and bilinear model (9.3.63) 


Parametric model 
(9.3.63) 

ei = W m (s)p*(u p - 9* T tv + rjo) 

9 o = a 1 [A m(u p + d u ) + d u \ 

Filtered 

parametric model 

e/ = W m (s)L(s)p*(uf - 6* T cj) + rjf) 
e/ = 4°h 0 ei 

Vf = L o{s)r] 0 , Uf = L 0 (s)u p , $ = L 0 {s)u 

L 0 (s) = L- 1 (s) s § o ,ho>5 0 /2 

L(s) is chosen so that W m L is proper and SPR 
L(s),L -1 (s) are analytic in Re(s) > — ^ 

Normalized 
estimation error 

e = ef - if - W m Lenj,ef = W m Lp(uf - 0 T 4>) 
n 2 = m s , rh s = -5 0 m s + u 2 + y 2 ,m s (0) = 0 

Robust adaptive 
laws with 
(a) Leakage 

9 = Tecj)Sgn(p*) — w\T9, p = — ^27 p 

£ = Uf — 9 r cj) 

where w \ . W 2 are as defined in Table 8.1 and T = 

r T > 0,7 > 0 

(b) Dead zone 

9 = r^(e + g)sgn(p*), p = 7 f(e + s) 
where g is as defined in Table 8.7 

(c) Projection 

9 = Pr[r^esgn(p*)], p = Pr[7e£] 

where the operator Pr[-] is as defined in Table 8.6 

Properties 

v 2 

(i) e, en Sl 9, p <E £00, (H)e, en s , 9, p <E <S(/ 0 + ^>), 
where m 2 = 1 + n 2 and /o is a design parameter, 
depending on the choice of w\ , W 2 and g in (a) , (b) , 
and /o = 0 for (c) 
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Table 9.3 Robust adaptive laws based on the gradient method 
and bilinear model (9.3.63) 


Parametric model 
(9.3.63) 

ei = W m (s)p*(u p - 9* T tv + 770) 

9 0 = a 1 [A m (Up + d u ) + du] 

Parametric model 
rewritten 

ei = p*(uj - 9* T (j) + rj),Uf = W m u p , </> = W m u; 

V = A 1 W m [A m (u p + d u ) + d u ] 

Normalized 

estimation error 

e = i = u f 9 t 4>, m 2 = 1 + n 2 

n 2 = m s , m s = -5 0 m s + u 2 + y 2 ,m s (0) = 0 

Robust 
adaptive laws 

Same expressions as in Table 9.2 but with 
e, (j>,£,Uf as given above 

Properties 

(i) e, en s , 9, p, 9, p e Too; (h) e, en s , 9, 
v 2 

pG«S(/o + ^), where /o is as defined in Table 
9.2 


where 


Vy = 


A -C\ 


D* 

^T m \A m {Up + d u ) + d u ], Tj u = ^ ^ W m \A m (Up T d u ) + d u ] 


where Cf(s) = 9f r a(s), D\ = 9\ T a(s) + #|A(s). Using the properties of the 
£2 <5 norm || (•)* II 25 ? which for simplicity we denote as || (•)]], and the stability 
of W m , G 0 1 , it follows that there exists 6 € (0, <5o] such that 


WllpW < c + c\\9 t uj\\ + \\rj y 
Kll < c + c\\9 T uj\\ + \\i] u 


The constant 5 > 0 is such that G 0 1 (s) is analytic in Refs] > —5/2. There- 
fore, the fictitious normalizing signal m 2 = 1 + ||u p || 2 + ||y p || 2 satisfies 
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Table 9.4 Robust adaptive laws based on the linear model 

(9.3.62) 


Parametric 

model 

z = 0* T (j) p 17,77 = A x W m [A m (u p + d u ) + d u \ 

Z = 1 VmUp, (j)p — [Wm^/^Up, Wm~\~ypi ^PmUpi Up]^ 

Normalized 

estimation 

error 

e=^4, z = e^^,e = [eJ,eJ,9 3 ,co} T 

m 2 = 1 + m s , rh s = -5om s + u 2 + y%,m 8 ( 0 ) = 0 

Constraint 

B(0) = c 0 — CQsgn(^r-) < 0 for some c 0 > 0 
satisfying 0 < c 0 < Cq 

Projection 

operator 

r / if |co| > C 0 

Pr[/]=< °r if Co = c 0 and f T VB < 0 

V (v B) 1 rvB-f otherwise 

VB = -[0, ... ,0, 1 \ T sgn(k p /k m ) 

r = r T > o 

Robust 
adaptive law 

9 = Pr '[/], c 0 (0) > Cq 

(a) Leakage 

(b) Dead zone 

(c) Projection 

f = Te(j)p — wTO, w as given in Table 8.1 

/ = T(j>p{e + g),g as given in Table 8.4 

(Br[Tect>p\ if 0 < M 0 

0=1 or \6 \ = Mq and (Pr[r ) T 6 ? < 0 

\(I - |rf^)Pr[Te 0 p] otherwise 
where M 0 > |0(O)| < Mo and Pr[-] is the projec- 

tion operator defined above 

Properties 

• • 2 
(i) e,en s ,0,6 £ Too! (h) e,en s ,0 £ <S (/ 0 + ^), where 

/o is as defined in Table 9.2 
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0 u) t ] i — A m {up du) d u 



Figure 9.3 The closed-loop MRAC scheme in the presence of unmodeled 
dynamics and bounded disturbances. 

Step 2. Use the swapping lemmas and properties of the C 2 s norm to 
bound ||0 T u;|| from above with terms that are guaranteed by the robust adap- 
tive laws to have small in m.s.s. gains. In this step we use the Swapping 
Lemma A.l and A. 2 and the properties of the £ 25 ~ norm to obtain the ex- 
pression 

||(? T w|| < c\\gmA\ + c(— + agAoo )mt + cd 0 (9.3.67) 

«o 

where «o > nraxjl, 5 q /2} is arbitrary and g £ 5(/o + A? + with A j = 
Ao 2 ; k = n* + 1 in the case of the adaptive law of Table 9.2, A j = A 2 , k = n* 
in the case of the adaptive laws of Tables 9.3 and 9.4, and do is an upper 
bound for \d u \. 

Step 3. Use the B-G Lemma to establish boundedness. Using (9.3.67) 
in (9.3.66) we obtain 

mj < c + c\\gm, f \\ 2 + c ( + a^A^ ) m 2 + cd 2 0 

\ a o / 

We choose «o large enough so that for small Aoo 

c (ff 2 +a o* A «) < 1 
\ ^0 / 
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We then have m 2 < c + c|| / 1| 2 for some constants c > 0, which implies 
that 

rt 

m 2 <c + cj e~ s ^~ T ^g 2 (T)m 2 f(T)dT 
Applying the B-G Lemma III, we obtain 

m) < ce~ St e c fo g2 ^ dT + c<5 J * e~ s ^e c f! 92{r)dT ds 

Because 

[ g 2 {r)dT < c(/ 0 + A 2 )(t - s) + c [ —\dr + c 
J s J s TYl 

Vf > s > 0, it follows that for c(/o + A?) < (5/2 we have 

(' dl/m 2 )dr + C( j f* e -^(ts) e cJ t s {dl/m 2 )dT^ s 

Jo 

d 2 

The boundedness of m/ follows directly if we establish that c m2 °^ < 5/2 , 

Vf > 0. This condition is satisfied if we design the signal as n 2 = /?o + m s 

d 2 d 2 

where /3q is a constant chosen large enough to guarantee c m2 0 ^ < c < 
5/2, Vi > 0. This approach guarantees that the normalizing signal m 2 = 
1 + n 2 is much larger than the level of the disturbance all the time. Such a 
large normalizing signal may slow down the speed of adaptation and, in fact, 
improve robustness. It may, however, have an adverse effect on transient 
performance. 

The boundedness of all signals can be established, without having to 
modify n 2 , by using the properties of the C 2 s norm over an arbitrary interval 
of time. Considering the arbitrary interval [ti,t) for any t\ > 0 we can 
establish by following a similar procedure as in Steps 1 and 2 the inequality 

m 2 {t) < c(l + m 2 (ii))e _ 2 ^ _ h) e c /t 1 

+c5 / e“5 (t " s) e A mTP ds,\/t >h>0 (9.3.68) 

Jti 

We assume that m 2 (f) goes unbounded. Then for any given large number 
a > 0 there exists constants £2 > ii > 0 such that m 2 (ti) = a, m 2 (t 2 ) > 
/i(a), where fi(a) is any static function satisfying /i(d) > a. Using the fact 
that m 2 cannot grow or decay faster than an exponential, we can choose f\ 


9 9_+ c 

rri f < ce A e 


r 

Jo 
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properly so that m 2 (t) > a Vi € [i 1 , ^ 2 ] for some t\> a where t 2 — t\ > a. 
Choosing a large enough so that d^/a < (5/2, it follows from (9.3.68) that 

m 2 (t 2 ) < c(l + a)e~^ a e cd o + c 

We can now choose a large enough so that m 2 (t- 2 ) < ct which contradicts 
with the hypothesis that m 2 (t 2 ) > a and therefore m 6 /loo- Because m 
bounds u p ,y p ,LU from above, we conclude that all signals are bounded. 

Step 4. Establish bounds for the tracking error e\. Bounds for ei in 
rn.s.s. are established by relating e\ with the signals that are guaranteed by 
the adaptive law to be of the order of the modeling error in rn.s.s. 

Step 5. Establish convergence of estimated parameter and tracking error 
to residual sets. Parameter convergence is established by expressing the pa- 
rameter and tracking error equations as a linear system whose homogeneous 
part is e.s. and whose input is bounded. 

The details of the algebra and calculations involved in Steps 1 to 5 are 
presented in Section 9.8. 


Remark 9.3.2 Effects of initial conditions. The results of Theorem 9.3.2 
are established using a transfer function representation for the plant. 
Because the transfer function is defined for zero initial conditions the 
results of Theorem 9.3.2 are valid provided the initial conditions of 
the state space plant representation are equal to zero. For nonzero 
initial conditions the same steps as in the proof of Theorem 9.3.2 can 
be followed to establish that 

rt 

m 2 {t) <c + cpo + cpoe~ 5t + c J e -' 5 P-' r ) g 2 (r)m 2 (T)<iT 

where po > 0 depends on the initial conditions. Applying the B-G 
Lemma III, we obtain 


m 2 (t) < (c + cpo)e 5t e c $o 3 < - s ' >ds + 5(c + cpo) J e S( ' t T ^e c ^ 9 ^ ds dr 


where g € <S(/o + A? + c^/m 2 ) and /o,Aj are as defined in Theo- 
rem 9.3.2. Therefore the robustness bounds, obtained for zero initial 
conditions, will not be affected by the non-zero initial conditions. The 
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bounds for rrif and tracking error ei, however, will be affected by the 
size of the initial conditions. 

Remark 9.3.3 Robustness without dynamic normalization. The results of 
Theorem 9.3.2 are based on the use of a dynamic normalizing signal 
m s = n 2 so that m = \fl + n 2 s bounds both the signal vector <f> and 
modeling error term r/ from above. The question is whether the signal 
m s is necessary for the results of Theorem 9.3.2 to hold. In [168, 240], 
it was shown that if m is chosen as m 2 = 1 + i.e., the same 

normalization used in the ideal case, then the projection modification 
alone is sufficient to obtain the same qualitative results as those of 
Theorem 9.3.2. The proof of these results is based on arguments over 
intervals of time, an approach that was also used in some of the original 
results on robustness with respect to bounded disturbances [48]. The 
extension of these results to modifications other than projection is not 
yet clear. 

Remark 9.3.4 Calculation of robustness bounds. The calculation of the 
constants c, 6, AjjAoo is tedious but possible as shown in [221, 227]. 
These constants depend on the properties of various transfer func- 
tions, namely their H 00 $,H 28 bounds and stability margins, and on 
the bounds for the estimated parameters. The bounds for the esti- 
mated parameters can be calculated from the Lyapunov-like functions 
which are used to analyze the adaptive laws. In the case of projection, 
the bounds for the estimated parameters are known a priori. Because 
the constants c, AjjAoo,^ depend on unknown transfer functions and 
parameters such as Gq(s),Gq 1 (s),0* , the conditions for robust stabil- 
ity are quite difficult to check for a given plant. The importance of the 
robustness bounds is therefore more qualitative than quantitative, and 
this is one of the reasons we did not explicitly specify every constant 
in the expression of the bounds. 

Remark 9.3.5 Existence and uniqueness of solutions. Equations (9.3.65) 
together with the adaptive laws for generating 0 = 6 + 6* used to 
establish the results of Theorem 9.3.2 are nonlinear time varying equa- 
tions. The proof of Theorem 9.3.2 is based on the implicit assumption 
that these equations have a unique solution Vi G [0, oo). Without 
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this assumption, most of the stability arguments used in the proof of 
Theorem 9.3.2 are not valid. The problem of existence and unique- 
ness of solutions for a class of nonlinear equations, including those of 
Theorem 9.3.2 has been addressed in [191]. It is shown that the sta- 
bility properties of a wide class of adaptive schemes do possess unique 
solutions provided the adaptive law contains no discontinuous modifi- 
cations, such as switching a and dead zone with discontinuities. An 
exception is the projection which makes the adaptive law discontinuous 
but does not affect the existence and uniqueness of solutions. 

The condition for robust stability given by (9.3.64) also indicates that the 
design parameter fo has to satisfy certain bounds. In the case of switching a 
and projection, fo = 0, and therefore (9.3.64) doesnot impose any restriction 
on the design parameters of these modifications. For modifications, such as 
the e-modification, fixed a, and dead zone, the design parameters have to 
be chosen small enough to satisfy (9.3.64). Because (9.3.64) depends on 
unknown constants, the design of fo can only be achieved by trial and error. 

9.3.4 Robust Indirect MRAC 

The indirect MRAC schemes developed and analyzed in Chapter 6 suffer 
from the same nonrobust problems the direct schemes do. Their robustifica- 
tion is achieved by using, as in the case of direct MRAC, the robust adaptive 
laws developed in Chapter 8 for on-line parameter estimation. 

In the case of indirect MRAC with unnornralized adaptive laws, robusti- 
fication leads to semiglobal stability in the presence of unmodeled high fre- 
quency dynamics. The analysis is the same as in the case of direct MRAC 
with unnornralized adaptive laws and is left as an exercise for the reader. 
The failure to establish global results in the case of MRAC with robust but 
unnornralized adaptive laws is due to the lack of an appropriate normaliz- 
ing signal that could be used to bound from above the effect of dynamic 
uncertainties. 

In the case of indirect MRAC with normalized adaptive laws, global 
stability is possible in the presence of a wide class of unmodeled dynamics 
by using robust adaptive laws with dynamic normalization as has been done 
in the case of direct MRAC in Section 9.3.3. 
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We illustrate the robustification of an indirect MRAC with normalized 
adaptive laws using the following example: 

Example 9.3.3 Consider the MRAC problem for the following plant: 

Up = — (1 + A m (s))u p (9.3.69) 

s — a 

where A m is a proper transfer function and analytic in Re[s] > — <jg/2 for some 
known (5 0 > 0, and a and b are unknown constants. The reference model is given by 

Vm = — : r, a m > 0 

s + a m 

If we assume A m (s) = 0, the following simple indirect MRAC scheme can be used 
to meet the control objective: 


Up — 

-k(t)y p + l{t)r 

(9.3.70) 

k{t) = 

CLm ~\~ CL ^ bra 

S’ ~ b 


. ( 7ie^i, 

if 6 > b 0 or 


a = 7 2 e0 2) 6 = < 

if S = b 0 and e0isgn(6) > 0 

(9.3.71) 

{ o 

otherwise 



where 


m 2 = 1 + 0? + x2 


lTft, 9i — — ~ U P’ — Up 

S ~r A 


s A 


z — z 


€ = 


z = 


s H - A 


Vp , z=b(j) 1 + a<j)2 


6(0) > 6o, &o is a known lower bound for |6| and A > 0 is a design constant. When 
(9.3.70) and (9.3.71) are applied to the plant (9.3.69), the ideal properties, such as 
stability and asymptotic tracking, can no longer be guaranteed when A m (s) ^ 0. 
As in the case of the direct MRAC, the boundedness of the signals can be lost due 
to the presence of unmodeled dynamics. 

The indirect MRAC scheme described by (9.3.70) and (9.3.71) can be made 
robust by using the techniques developed in Chapter 8. For example, instead of the 
adaptive law (9.3.71), we use the robust adaptive law 


a 

b 


7 2 e02 - cr s 7ia 

{ 7ie^i - 7icr s 6, 

0 


if |6| > b 0 or 

if |6| = bo and (e(f > i — a s b)sgn(b ) > 0 (9.3.72) 
otherwise 
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where 


2 — 2 

m2 7 


2 = b(j> i + af 2 


1 


n = 
2 


<f> 2 = 


1 


s + A“ F ’ ~ s + X Vp ’ *~s + A 

m^l + nf, n 2 s = m s , m s = -(5 0 m s + + y 2 , 


y P 


m s (0) = 0 

The design constants A and a TO are chosen as A > 5o/2,a m > <5o/2, cr s is the 
switching cr-modification as defined in Chapter 8, bo is a lower bound satisfying 
0 < bo < |6|. The above robust adaptive law is developed using the parametric 
model 

2 = 9* T (j) + T] 

for the plant (9.3.69) where 9* = [b,a\ T , 


1 


1 


s + A y s + A 


:Up 


= [<t> 1 , 02 ] , V = 


bA-m(s) 
s T A 


As we have shown in Chapter 8, the adaptive law (9.3.72) guarantees that 

(i) e, em, a, b,a,b £ 

(ii) e,en s ,a,b£ <S(A|), where A 2 = ||^^|| 2 « 0 


V 


Let us now apply the MRAC scheme given by the control law (9.3.70) 
and robust adaptive law (9.3.72) to the plant (9.3.69). 

Theorem 9.3.3 The closed-loop indirect MRAC scheme given by (9.3.69), 
(9.3.70), and (9.3.72) has the following properties: If r,r 6 and the 
plant uncertainty A m (s) satisfies the inequalities 

C -.2 i „„ 2 a 2 , „ a 2 ^ ^0 


~2 + C ^oo + ca 0^X < 1) c ^2 ^ 7T 


Q(n 


where 

Aoc = ||R / TO (s)A m (s)|| oo 5 0 , A 2 = 


A m(s) 


s T A 


25 0 


, Aa = 


A 


s T A 


oo5q 


where ao > do is an arbitrary constant and c > 0 denotes any finite constant, 
then all signals are bounded and the tracking error e\ satisfies 

rt+T 

/ e 2 dr < cA 2 + c 


for all t > 0 and any T > 0. 
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Proof As in the direct case, the proof may be completed by using the following 
steps: 

Step 1. Express y p , u p in terms of the plant parameter error a = a— a, b = b—b. 
First we write the control law as 

u p = —ky p + Ir = —k*y p + l* r — ky p + Ir (9.3.73) 

where k* = ( a m + a)/b , T = b m /b. Because k = a/b — kb/b and l = —Ib/b, (9.3.73) 
can also be written as 

u p = —k*y p + l*r — ~^(ay p + bu p ) (9.3.74) 

by using the identity —ky p + Ir = u p . Using (9.3.74) in (9.3.69), we obtain 

y P = W m (s) 9 t uj p + m (s)u^j (9.3.75) 

where u> p = [up,y p ] T and 6 = [5, a] T . 

Equation (9.3.74) may be also written in the compact form 

Up = —k*y p + l*r — j6 T u>p (9.3.76) 

Using the properties of the C -25 norm which for simplicity we denote by ||(-)||, we 
have 

IMI < C + c\\e T oj p \\ + 1^1 ||W m (s)A m (s)||oo (5 ||up|| 

IKII < c + c\\y p \\ + c\\e T UpW 
which imply that 

IKII, lll/pll < c + c\\e T aj p \\ + cAoollupH 

where Aoo = ||M / m (s)A m (s)|| 00 5 for some 8 € (0, Jo] - The fictitious normalizing 
signal m 2 = 1 + ||w p || 2 + \\y p \\ 2 satisfies 

inj <c + c\\e T Lo p \\ 2 + cA 2 OQ m 2 f (9.3.77) 


Step 2. Obtain an upper bound for ||Fw p | in terms of signals that are guar- 
anteed by the adaptive law to have small in m.s.s. gains. We use the Swapping 
Lemma A. 2 to express 6'u> p as 


1 - 
1 


a 0 


S ~r Q!q 
lT 


Ft 


OL 0 


5T, 


-{9 ujp + 9 LOp) + 


1/ UJp 

S + Qq 

q:o(^ H - A) 1 


-0 UJn 


s + a 0 


s H- c^o s H- A 


(9.3.78) 
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where ao > So is arbitrary. From Swapping Lemma A.l, we have 


1 

s T A 


= e T c 


i 

s t A 


<yo 


where (f) = From the estimation error equation 


em 2 = z — z = — 6 ' <j> + y 


we have 


d T (j) = — em 2 + y 

and, therefore, (9.3.79) may be expressed as 

^ t lu p = — em 2 + y -<^ T d 


s T A 

Substituting (9.3.80) in (9.3.78), we obtain 

rr 1 


s + a 0 


[0 


IT,-. 1 , a o(s + A) 

'p t i/ u/p I “I ; 

5 + a 0 


;t " 1 e-w. 


s T A 


—em + y — 


s T A 


eye 


(9.3.79) 


(9.3.80) 


(9.3.81) 


The signal mj bounds from above |w p |, ||w p ||,m, This can be shown as 

follows: Because the adaptive law guarantees that 9 £ Coo, it follows from (9.3.75) 
that 

\y p {t)\ < c T c 1 1 ujp 1 1 + cAi||u p || < c+c\\u p \\ + c\\y p \\ 

where Ai = ||W TO (s)A TO (s)|| 25 , and, therefore, y p /mf £ Coo- From u p = —ky p + Ir 
and k,l £ C oo, it follows that u p /mf £ Coo and, therefore, u> p /mf £ Coo- Using 
(9.3.75) we have 


lls/pll < ||sW m (s)||oo<5(c + c||w p ||) + c||sM / m (s)A m (s)|| 00 5 ||Up|| 

< c + c||w p || + c||u p || < c + cm/ 

Therefore, ||y p ||/m/ £ Coo- Now ii p = —ky p + ky p + lr + If. Because k, l G Coo and 
by assumption r, f £ Coo, it follows from \y p \/mf £ Coo and ||y p ||/rn/ G Coo that 
\\u P \\/m.f £ Coo- From ||Ap|| < ||w p || + ||y p ||, it follows that ||A p ||/m/ G Coo- The 
boundedness of m/?n/, (f>/mf follows in a similar manner by using the properties of 
the C 25 norm. 

Using the normalizing properties of m/ and the properties of the C 26 norm, we 
obtain from (9.3.81) the following inequality: 

ll^pll < -&i\\0mf\\ + m f ) + a 0 (\\emm f \\ + A x m f + \\6m f \\ ) (9.3.82) 

ao 

where for the ||?y|| term we used the inequality 


IMI < A A ||it p || < cA x m f 
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where = ||^^||oo< 5 - The “smallness” of ||0 T w p || follows from 9, an G <S(A|) 
and by choosing «o large. 

Step 3. Use the B-G Lemma to establish boundedness. Using (9.3.82) in 
(9.3.77), we obtain 

m 2 < c+ c\\gmf\\' 2 H — ^ m 2 + ca 2 0 A \m 2 + cA^ra 2 
a 0 

where g 2 = (-^ + ag)|6*| 2 + ck§ | e?7i. | 2 , i.e. , g G S( A|). Therefore, for 


+ ca o^\ + cN'L, < 1 


a 


we have 


or 


m 2 < c+c\\gm f \Y 


ds 


m 2 (t)<c + c f e s( - t r) 5 2 (r)m^(r)dr 
Jo 

Applying the B-G Lemma III, we obtain 

m}(t) < ce- 6t e c fo s2(T)dT + cS f* e ~ s ^e c fs 

Jo 

Because g G ^(Aj), i.e., f* g 2 (r)dr < cA 2 (t — s) + c, it follows that for cA 2 < S, 
m 2 f G Coo which implies that all signals are bounded. 

Step 4. Error bounds for the tracking error. Using (9.3.75), the tracking error 
e-[ = y p - Urn is given by 


1 


ei = 


s + a r 


-e T i 


s T A 
s + a 7 


-V = 


s T A 
s + a m 


s T A 


u) p ~r] 


Using (9.3.80), we have 


ei = 


s T A 
s T u rn 


em 


s T A 


(f l e 


(9.3.83) 


Because an, 6 G S(A|) and </>, m G C^, it follows from (9.3.83) and Corollary 3.3.3 
that ei G <S(A 2 ), i.e., 

rt-\-T 

/ e\dr < cA\T + c 

Vf > 0 and any T > 0. □ 


The extension of this example to the general case follows from the mate- 
rial presented in Section 9.3.3 for the direct case and that in Chapter 6 for 
indirect MRAC and is left as an exercise for the reader. 
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9.4 Performance Improvement of MRAC 


In Chapter 6 we have established that under certain assumptions on the 
plant and reference model, we can design MRAC schemes that guarantee 
signal boundedness and asymptotic convergence of the tracking error to 
zero. These results, however, provide little information about the rate of 
convergence and the behavior of the tracking error during the initial stages 
of adaptation. Of course if the reference signal is sufficiently rich we have 
exponential convergence and therefore more information about the asymp- 
totic and transient behavior of the scheme can be inferred. Because in most 
situations we are not able to use sufficiently rich reference inputs without 
violating the tracking objective, the transient and asymptotic properties of 
the MRAC schemes in the absence of rich signals are very crucial. 

The robustness modifications, introduced in Chapter 8 and used in the 
previous sections for robustness improvement of MRAC, provide no guaran- 
tees of transient and asymptotic performance improvement. For example, 
in the absence of dominantly rich input signals, the robust MRAC schemes 
with normalized adaptive laws guarantee signal boundedness for any finite 
initial conditions, and a tracking error that is of the order of the modeling 
error in rn.s.s. Because smallness in rn.s.s. does not imply smallness point- 
wise in time, the possibility of having tracking errors that are much larger 
than the order of the modeling error over short time intervals at steady state 
cannot be excluded. A phenomenon known as “bursting,” where the track- 
ing error, after reaching a steady-state behavior, bursts into oscillations of 
large amplitude over short intervals of time, have often been observed in 
simulations. Bursting cannot be excluded by the rn.s.s. bounds obtained in 
the previous sections unless the reference signal is dominantly rich and/or 
an adaptive law with a dead zone is employed. Bursting is one of the most 
annoying phenomena in adaptive control and can take place even in sim- 
ulations of some of the ideal MRAC schemes of Chapter 6. The cause of 
bursting in this case could be the computational error which acts as a small 
bounded disturbance. There is a significant number of research results on 
bursting and other undesirable phenomena, mainly for discrete-time plants 
[2, 68, 81, 136, 203, 239]. We use the following example to explain one of 
the main mechanisms of bursting. 
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Example 9.4.1 (Bursting) Let us consider the following MRAC scheme: 

Plant x = ax + bu + d 

Reference Model x rn = — x m + r, x m (0) = 0 

Adaptive Controller 

u=e T uj, 9 = [0 O , c 0 ] T , u = [x,r\ T 
9 = Pr[— eiwsgn(6)] , e\ = x — x m (9.4.1) 


The projection operator in (9.4.1) constrains 9 to lie inside a bounded set, where 
|0| < M 0 and Mo > 0 is large enough to satisfy \9*\ < M 0 where 9* = [—^4, ^] T 
is the desired controller parameter vector. The input d in the plant equation is an 
arbitrary unknown bounded disturbance. Let us assume that |d(t)| < do,Vt P 0 for 
some do > 0. 

If we use the analysis of the previous sections, we can establish that all signals 
are bounded and the tracking error ei £ 5(dg) , i.e. , 


f e\dr < dg(t — to) + ko, Vt > to > 0 
Jto 


(9.4.2) 


where fco depends on initial conditions. Furthermore, if r is dominantly rich, then 
ei, 9 = 9 — 9* converge exponentially to the residual set 


So = 


jei, 9 |ei| + \9\ < cd 0 j 


Let us consider the tracking error equation 

ei — — €\ -\~ bu uj T d 


(9.4.3) 


and choose the following disturbance 

d = h(t)sat{—b(9 — 9 *) t uj} 


where h(t) is a square wave of period 1007T sec and amplitude 1, 


sat{x} = 


x if \x\ < d 0 
do if x > do 
—do if x < —do 


and 9 is an arbitrary constant vector such that \9\ < Mo- It is clear that |d(t)| < do 
Vt > 0. Let us consider the case where r is sufficiently rich but not dominantly 
rich, i.e., r(t) has at least one frequency but |r| <C do- Consider a time interval 
[fi,fi+7i] over which |ei(t)| < d 0 . Such an interval not only exists but is also large 
due to the uniform continuity of ei(t) and the inequality (9.4.2). Because |ei| < do 
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and 0 < |r| <C do, we could have \b(0 — 9*) t lu \ < do for some values of \9\ < Mo 
over a large interval [t 2 ,t 2 + T 2 ] C [t\,t\ + T). Therefore, for t, e \t 2 R 2 + T 2 ] we 
have d = —h{t)b{9 — 9*) T u> and equation (9.4.3) becomes 

ei = — ei + b(9-9) T co Vf e [< 2 , h + T 2 \ (9.4.4) 

where u> = h(t)u>. Because r is sufficiently rich, we can establish that u> is PE and 
therefore 9 converges exponentially towards 9. If T 2 is large, then 9 will get very 
close to 9 as t — > t 2 + T 2 . If we now choose 9 to be a destabilizing gain or a gain 
that causes a large mismatch between the closed-loop plant and the reference model, 
then as 9 — > 9 the tracking error will start increasing, exceeding the bound of do- 
In this case d will reach the saturation bound do and equation (9.4.4) will no longer 
hold. Since (9.4.2) does not allow large intervals of time over which |ei| > do, we 
will soon have \e\\ < do and the same phenomenon will be repeated again. 

The simulation results of the above scheme for a = 1, b = 1 , r = O.lsinO.Olf, 
do = 0.5, M 0 = 10 are given in Figures 9.4 and 9.5. In Figure 9.4, a stabilizing 
9 = [—3, 4] t is used and therefore no bursting occurred. The result with 9 = [0, 4] T , 
where 9 corresponds to a destabilizing controller parameter, is shown in Figure 9.5. 
The tracking error e\ and parameter 9i(t), the first element of 9, are plotted as 
functions of time. Note that in both cases, the controller parameter 9\(t) converges 
to 9 1 , i.e., to —3 (a stabilizing gain) in Figure 9.4, and to 0 (a destabilizing gain) 
in Figure 9.5 over the period where d = — h(t)b(9 — 9*) T ijj. The value of 0 2 = 4 is 
larger than 9^ = 1 and is responsible for some of the nonzero values of e\ at steady 
state shown in Figures 9.4 and 9.5. V 

Bursting is not the only phenomenon of bad behavior of robust MRAC. 
Other phenomena such as chaos, bifurcation and large transient oscillations 
could also be present without violating the boundedness results and rn.s.s. 
bounds developed in the previous sections [68, 81, 136, 239]. 

One way to eliminate most of the undesirable phenomena in MRAC is to 
use reference input signals that are dominantly rich. These signals guarantee 
a high level of excitation relative to the level of the modeling error, that in 
turn guarantees exponential convergence of the tracking and parameter error 
to residual sets whose size is of the order of the modeling error. The use 
of dominantly rich reference input signals is not always possible especially 
in the case of regulation or tracking of signals that are not rich. Therefore, 
by forcing the reference signal to be dominantly rich, we eliminate bursting 
and other undesirable phenomena at the expense of destroying the tracking 
properties of the scheme in the case where the desired reference signal is not 
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Figure 9.4 Simulation results for Example 9.4.1: No bursting because of 
the stabilizing 0 = [ — 3, 4] T . 
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Figure 9.5 Simulation results for Example 9.4.1: Bursting because of the 
destabilizing 6 = [0, 4] T . 
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rich. Another suggested method for eliminating bursting is to use adaptive 
laws with dead zones. Such adaptive laws guarantee convergence of the 
estimated parameters to constant values despite the presence of modeling 
error provided of course the size of the dead zone is higher than the level 
of the modeling error. The use of dead zones, however, does not guarantee 
good transient performance or zero tracking error in the absence of modeling 
errors. 

In an effort to improve the transient and steady-state performance of 
MRAC, a high gain scheme was proposed in [148], whose gains are switched 
from one value to another based on a certain rule, that guarantees arbi- 
trarily good transient and steady state tracking performance. The scheme 
does not employ any of the on-line parameter estimators developed in this 
book. The improvement in performance is achieved by modifying the MRC 
objective to one of “approximate tracking.” As a result, non-zero tracking 
errors remained at steady state. Eventhough the robustness properties of the 
scheme of [148] are not analyzed, the high-gain nature of the scheme is ex- 
pected to introduce significant trade-offs between stability and performance 
in the presence of unmodeled dynamics. 

In the following sections, we propose several modified MRAC schemes 
that guarantee reduction of the size of bursts and an improved steady-state 
tracking error performance. 

9.4.1 Modified MRAC with Unnormalized Adaptive Laws 

The MRAC schemes of Chapter 6 and of the previous sections are designed 
using the certainty equivalence approach to combine a control law, that 
works in the case of known parameters, with an adaptive law that provides 
on-line parameter estimates to the controller. The design of the control law 
does not take into account the fact that the parameters are unknown, but 
blindly considers the parameter estimates provided by the adaptive laws to 
be the true parameters. In this section we take a slightly different approach. 
We modify the control law design to one that takes into account the fact 
that the plant parameters are not exactly known and reduces the effect of 
the parametric uncertainty on stability and performance as much as possi- 
ble. This control law, which is robust with respect to parametric uncertainty, 
can then be combined with an adaptive law to enhance stability and per- 
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formance. We illustrate this design methodology using the same plant and 
control objective as in Example 9.4.1, i.e., 

Plant x = ax + bu + d 

Reference Model x m = — x m + r, x m (0) = 0 

Let us choose a control law that employs no adaptation and meets the 
control objective of stability and tracking as close as possible even though 
the plant parameters a and b are unknown. 

We consider the control law 

u = 9qx + cor + u a (9.4.5) 

where Oq, co are constants that depend on some nominal known values of 
a, b if available, and u Q is an auxiliary input to be chosen. With the input 
(9.4.5), the closed- loop plant becomes 

x = — x + b H ^ x + bcor + bu a + d (9.4.6) 

and the tracking error equation is given by 

ei = — ei + b6 0 ei + b0 o x m + bc 0 r + bu a + d (9.4.7) 

where 6q = 0q + Co = Co — l are the constant parameter errors. Let us 
now choose 

5 H - 1 

u a = sgn(6)ei (9.4.8) 

T-S 

where r > 0 is a small design constant. The closed-loop error equation 
becomes 

T S — ~ 

e\ = E [b0 o x m + bc 0 r + d] (9.4.9) 

ts 2 + (r + |6| - rb6 0 )s + \b\ 

If we now choose r to satisfy 

0 < r < — (9.4.10) 

\0o\ 

the closed-loop tracking error transfer function is stable which implies that 
ei, ^ei G Coo and therefore all signals are bounded. 
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Another expression for the tracking error obtained using x 
and (9.4.7) is 

TS — ~ 

ei = ( i rw ~ — + b ^° r + 

(s + l)(rs + | 6 |) 

or for | 6 | /rwe have 


A x m 

(9.4.11) 


ei 


\b\ - r 


\b\ 


ts + \b\ 


1 

s + 1 


(w + d) 


(9.4.12) 


where w — b(6ox + cqt) is due to the parametric uncertainty. Because 
x G Coo, for any given r G (0, 1/ 1 (9o | ) , we can establish that there exists a 
constant wq > 0 independent of r such that sup t \w(t)\ < wq. It, therefore, 
follows from (9.4.12) that 

M*)| < e - e-‘) | ei (0)| + j^(wo + d 0 ) (9.4.13) 


where do is the upper bound for \d(t)\ < do,yt > 0. It is, therefore, clear 
that if we use the modified control law 

— s + 1 

u = Oqx + cor sgn( 6 )ei 

TS 


with 


0 < r < min 




(9.4.14) 


then the tracking error will converge exponentially fast to the residual set 


Se 



ei | < 


2 r 

\b\ - t 


(w 0 + d 0 ) 


(9.4.15) 


whose size reduces to zero as r — ► 0 . 

The significance of the above control law is that no matter how we choose 
the finite gains 0q, Co, there always exist a range of nonzero design parameter 
values r for stability. Of course the further 0q is away from the desired 
00 where 0 q = — the smaller the set of values of r for stability as 
indicated by (9.4.14). Even though the tracking performance of this modified 
control law can be arbitrarily improved by making r arbitrarily small, we 
cannot guarantee that the tracking error converges to zero as t — » oo even 
when the disturbance d = 0. This is because the parameter error 9q,co 
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is nonzero and acts as a disturbance in the tracking error equation. The 
asymptotic convergence of the tracking error to zero in the case of d = 0 can 
be reestablished if, instead of using the constant gains 9o,cq in (9.4.5), we 
use an adaptive law to generate on-line estimates for the controller gains. 
Therefore, instead of combining (9.4.5) with u a = 0 with an adaptive law as 
we did in Chapter 6, we combine the modified control law 

g -U 1 

u = 6o(t)x + co{t)r sgn(6)ei (9.4.16) 

TS 

with an adaptive law that generates 9o{t),co(t), the estimates of 9q,Cq re- 
spectively. With (9.4.16) the tracking error equation may be written as 


ei = — ei + b0 T uj — 161 - — e\ + d (9.4.17) 

T-S 

where 9 = 9 - 9* ,9 = [0 O , c 0 ] T , 9* = [0o,Cq] t ,0o = ~ g ir, c o = \,u = [x,r\ T 


ei 

e2 



e\ + b9 T co 


ei 


| 6 | 


e2 + d 


(9.4.18) 


We develop the adaptive laws for 9q. cq by considering the following Lyapu- 
nov-like function 


V = 


(ei + e 2 ) 2 
2 




(9.4.19) 


where Iq > 0 is an arbitrary constant to be selected. 

The time-derivative V along the solution of (9.4.18) may be written as 


V 


- ^1 + ef - + (ei + e 2 )d + eie 2 

+(ei + e 2 )60 T w + |6|0 T Pr[-(ei + e 2 )wsgn(6)] 


2 | 6 | 


(9.4.20) 


by choosing the adaptive law 

9 = Pr [— (ei + e 2 )cusgn(6)] (9.4.21) 

f-(ei + e 2 )(usgn(6) if \9\ < M 0 or 

= J if |0| = Mo and 0 T (ei+e 2 )cusgn(6) > 0 

[- (/ - (ei + e 2 )wsgn (b) otherwise 
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where 1 0(0) | < Mq and M 0 > \0*\. Selecting 1$ = ^ + 1, completing the 
squares and using the properties of projection, we can establish that 


V < -^e 2 - ^e 2 + — 
- 2r 61 2t 62+ \b\ 


(9.4.22) 


which implies that 9,e i,e 2 G £oo and ei,e 2 G S(tcI 2 ), i.e. , the rn.s.s. bound 
for ei, e 2 can be made arbitrarily small by choosing an arbitrarily small r. In 
addition, x, lo G £oo- Furthermore if d = 0, we can establish that e\, e 2 — ► 0 
as t — * oo. 

Let us now obtain an Too-bound for e\. From (9.4.17) we have 


ei 


TS 

ts 2 + (r + \b\)s + |6| 


(w + d) 


(9.4.23) 


where w = b(r lo. Because we have shown above that w G Coo for any r > 0, 
we can treat w as a bounded disturbance term. As in the non-adaptive case 
we express e\ as 


ei 


|6| - r 




TS + |6| 


1 

s + 1 


(rc + d) 


(9.4.24) 


for \b\ ^ r, which implies that 


ei{t)\ < 


\b\ -t 



e i(0)| + 


2 r 

|6| - r 


(u; 0 + d 0 ) 


(9.4.25) 


where tco and do are the upper bounds for w and d respectively. It is therefore 
clear that the modified adaptive control scheme guarantees that the tracking 
error converges exponentially to the residual set 


S e 



ei | < 


2 r 

\b\ -t 


(w 0 + d 0 ) 


(9.4.26) 


whose size can be made arbitrarily small by choosing an arbitrarily small r. 
The residual set is qualitatively the same as in the non-adaptive case. The 
difference is in the value of iuq, the bound for the parametric uncertainty term 
bd T co. One can argue that tco should be smaller in the adaptive case than in 
the non-adaptive case due to the learning capability of the adaptive scheme. 
Another significant difference is that in the absence of the disturbance, i.e., 
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Figure 9.6 Simulation for (a) MRAC without modification; (b) modified 
MRAC with t = 0.5. 


d = 0, the modified adaptive scheme guarantees that e\ — > 0 as t — ► oo for 
any r > 0. 

The significance of the modified adaptive control scheme is that it guar- 
antees rn.s.s. and Coo bounds for the tracking error that can be made small 
by choosing a small design parameter r. This means that by the proper 
choice of r, we can significantly reduce bursting and improve the tracking 
error performance of the adaptive control scheme. 

We demonstrate the effectiveness of the modified scheme by simulating 
it with the same disturbance as in Example 9.4.1 The simulation results 
with r = 0.5 (a moderate value for the design parameter) are shown in 
Figures 9.6. It can be seen that the bursting is suppressed by the additional 
compensation term. 

The methodology used in the above example can be extended to the 
general case of MRAC with unnornralized adaptive laws. 





9.4. PERFORMANCE IMPROVEMENT OF MRAC 


703 


9.4.2 Modified MRAC with Normalized Adaptive Laws 

The method used in the previous section can be extended to MRAC with 
normalized adaptive laws. In this section we briefly describe two modifica- 
tions that can be used to improve the performance of MRAC with normalized 
adaptive laws [36, 39, 211, 212], 

We consider the general plant 

Up = k p ^ P [ S \ ( u p + d u ) (9.4.27) 

Up[s) 

where k p , Z p , and R p satisfy assumptions PI to P4 given in Section 6.3 and 
d u is a bounded input disturbance. For simplicity we assume that k p is 
known. The coefficients of Z p ,R p are completely unknown. The reference 
model is given by 

Um = W m {s)r = k m ^ m \ S \ r (9.4.28) 

Rm{s) 

where W m (s) satisfies assumptions Ml and M2 given in Section 6.3. 

We consider the control law 

Up = 9 q ( t)uJo + c* 0 r + u a (9.4.29) 

where 8 0 = [9j , 8j , 9 3 } T , uj 0 = [uj , loJ , y p } T , c* 0 = k m /k p ,uji = ^u p ,uj 2 = 

j^Up, a(s) = [s n-2 , •••,«, 1] T and A = Ao Z m is a Hurwitz polynomial of 
degree n — 1 . 

The auxiliary input u a is to be chosen for performance improvement. The 
parameter vector #o(t) may be generated by any one of the robust adaptive 
laws of Chapter 8. As an example let us use the gradient algorithm with 
projection based on the parametric plant model z = 0g T </>o — d v , developed 
using equation (9.3.62), where z = W m (s)u p — c^i/p, d v = A '' W m d„. i.e. , 

9 0 = Pr[re0 o ] 

_ z - 0([ 4>o 

^ 2 
m L 

m 2 = 1 + n 2 , n 2 = m s (9.4.30) 

m s = -5 0 m s + u 2 p + y 2 , m s (0) = 0 

4>o = W m (s)u 0 
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0ju>o 


Figure 9.7 A closed-loop representation of the modified MRAC scheme. 

where the projection operator Pr [-] constrains do to satisfy |#o(£)| < Mo Vi > 
0 for some Mo > | 9q | , T = T t > 0 and So > 0 is chosen so that W m (s) and 
the filters for generating loo are analytic in i?e[s] > — Sq/2. 

With (9.4.29) the tracking error equation may be written as 

ei = \w m (s) [<9([ w 0 + u a + di] (9.4.31) 

c o 

where d\ = a d u ,6o = 6q — 9q by following the same procedure as the 

one used to develop equation (9.3.65). The idea now is to choose u a so 
that all signals in the closed-loop plant remain bounded as in the case of 
u a = 0 and the effect of 0q loq + d\ on e\ is reduced as much as possible. 
We present two different methods that lead to two different choices of u a 
achieving similar results. 



Method 1 . This method is developed in [36, 212] and is described as 
follows: 

We choose u a as 

Ua = -C(s)e 1 (9.4.32) 

where C(s) is a proper transfer function to be designed. With (9.4.32) the 
tracking error system may be represented by the Figure 9.7. 

The set of all stabilizing compensators C(s) for this system is given by 


C(s) 


Q(s) 

1 - ^W m {s)Q{s) 
c 0 


(9.4.33) 


where Q(s) ranges over the set of all stable rational transfer functions [36]. 
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With this choice of C(s), the tracking error equation becomes 


ei 


1 - \w m (s)Q{s) 
c 0 


-^W m (s)[9Q u>o + d\] 

c o 


which implies that 


eit||oo < ||^m.||i(||(6'([wo)t||oo + ||c?it||oo) 


where h m is the impulse response of the system with transfer function 
^*W m (s)(l — ^W m (s)Q(s)) . The problem now reduces to choosing a proper 
stable rational function Q(s) to minimize the jC\ norm of h m , he., to make 
||/i m ||i as small as possible. 


Lemma 9.4.1 The transfer function 


Q(s) 


CpWnfHs) 

(ts + l ) n * 


(9.4.34) 


where n* is the relative degree of the plant and r > 0 is a design constant 
guarantees that 

1 1 h m 1 1 1 —>-0 as t — ► 0 


Proof We have 


1 


1 - -W m (s)Q(s) 


-'0 


W m (s) (ts+1)"* -1 W m (s) 


-o 


(ts + 1 y 


= TS 


w m (s ) 


-o 


1 


1 


1 


Hence, 


TS + 1 (TS + l) 2 

ll^mlll < T || 7?0 1| 1 [II Til 111 + II *-2 II 1 


(rs + l) r 

Mi] 


where ho(t) is the impulse response of sW m (s)/cQ and hft) the impulse response of 
1/(ts + 1)*, i = 1, 2, . . . , n* . It is easy to verify that ||/io||i, ||/ii|| l are bounded from 
above by a constant c > 0 independent of t. Therefore ||/i m ||i < tc and the proof 
is complete. □ 


Using (9.4.32), (9.4.33) and (9.4.34), the control law (9.4.29) becomes 

Up = 9 q (t)u>o + CqT — C(s)e\ (9.4.35) 

doW-'js) 

(ts+ l ) n * - 1 


C(s) 
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which together with the adaptive law (9.4.30) and Cq = k m /k p describe the 
modified MRAC scheme. 

The control law (9.4.35) poses no implementation problems because C(s) 
is proper, and ei is measured. 

Theorem 9.4.1 The MRAC scheme described by (9-4-30) and (9-4-35) with 
t e (0,1/ <5o) guarantees the following properties when applied to the plant 
(9-4-21): 

(i) All signals are bounded. 

(ii) The tracking error satisfies 

lim sup |ei(r 0 )| < r(c + do) 

t ^ 00 T 0 >t 

where do is an upper bound for the disturbance d u and c > 0 is a 
constant independent of t. 

(iii) When d u = 0, e\(t) — > 0 as t — * oo. 

The proof of Theorem 9.4.1 follows from the proofs of the standard robust 
MRAC schemes and is left as an exercise for the reader. 

Theorem 9.4.1 indicates that the steady state value of e± can be made 
arbitrarily small by choosing a small design parameter r. Small r implies the 
presence of a high gain equal to 1/r in the control law (9.4.35). Such a high 
gain is expected to have adverse effects on robust stability demonstrating the 
classical trade-off between robust stability and nominal performance that is 
present in every robust control scheme in the nonadaptive case. In the 
presence of unmodeled dynamics it is expected that r has to meet a lower 
bound for preservation of robust stability. 

Method 2. This method is developed and analyzed in [39, 211] and is 
described as follows: 

We consider the tracking error equation (9.4.31) 

ei = —W m (s)(0ou o + u a + di) 
c o 
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Using Swapping Lemma A.l we have 

W m (s)0ju, o = + W c (s)(W b (s)u$)'e o 

where the elements of W c , Wb are strictly proper transfer functions with the 
same poles as W m (s) . From the expression of the normalized estimation 
error in (9.4.30), we have 


em 2 = 

where d v = W m (s)d\. Therefore, em 2 = — W m (s)d\ leading to the 

tracking error equation 



—em 2 + W c (WbUiQ )0 q + W m u a 


Because e, W c , 9 q are known, the input u a is to be chosen to reduce 

the effect of 9q, 9q on e\. 

Let us choose 

u a = —Q(s)[—em 2 + U / c (s)(W b (s)a; ( [)6'o] = -Q(s)W m (s)(6l u 0 + di) 

(9.4.36) 

where Q(s) is a proper stable transfer function to be designed. With this 
choice of u a , we have 

e i = -irK 1 “ W m {s)Q{s))W m {s){eQUjQ + di)] 
c o 

which implies that 


l e ltl|oo < ||^m||i(||(^o u o)t 


\\du ||c 


where h m (t ) is the impulse response of (1 — W m (s)Q(s))W m (s). 
As in Method 1 if we choose Q(s ) as 


Q(s) = 


W-\s) 

( TS + l) n ’ 


(9.4.37) 


we can establish that ||/i m ||i 
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With (9.4.37) the modified control law (9.4.29), (9.4.36) becomes 
W~ 1 (s) 

Up = OqWq + c* 0 r - + W c (s)(W b (s)^ )9 0 ) (9.4.38) 

(TS + l) n 

where W c , W b can be calculated from the Swapping Lemma A.l and 6 q = 9q 
is available from the adaptive law. 

Theorem 9.4.2 The modified MRAC scheme described by (9-4.30), (9-4-38) 
with t E (O,l/<5 0 ) guarantees the following properties when applied to the 
plant (9-4-27): 

(i) All signals are bounded. 

(ii) The tracking error e\ satisfies 

lim sup |ei(r 0 )| < r(c + d 0 ) 

t -* 00 T 0 >t 

where do is an upper bound for the disturbance d u and c> 0 is a finite 
constant independent of t. 

(iii) When d u = 0 we have |ei(t)| — > 0 as t — > oo. 


The proof of Theorem 9.4.2 follows from the proofs of the robust MRAC 
schemes presented in Section 9.3 and is given in [39]. 

In [39], the robustness properties of the modified MRAC scheme are 
analyzed by applying it to the plant 


y P 


Z{s) 

’R(s) 


(1 + /iA 

m (s))u P 


where /jA m (s) is a multiplicative perturbation and // > 0. It is established 
that for r € (r mm , ^) where 0 < r mm < there exists a fi*{r min ) > 0 such 
that all signals are bounded and 


lim sup |ei(ro)| < rc + p,c 

t— >0O TQ >f 

where c > 0 is a constant independent of r, p,. The function fj,*(r m i n ) is 
such that as r mm — > 0,/r*(r m j n ) 0 demonstrating that for a given size 

of unmodeled dynamics characterized by the value of //* , r cannot be made 
arbitrarily small. 
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Remark 9.4.1 The modified MRAC schemes proposed above are based on 
the assumption that the high frequency gain k p is known. The case of 
unknown k p is not as straightforward. It is analyzed in [37] under the 
assumption that a lower and an upper bound for k p is known a priori. 

Remark 9.4.2 The performance of MRAC that includes transient as well 
as steady-state behavior is a challenging problem especially in the 
presence of modeling errors. The effect of the various design param- 
eters, such as adaptive gains and filters, on the performance and ro- 
bustness of MRAC is not easy to quantify and is unknown in gen- 
eral. Tuning of some of the design parameters for improved perfor- 
mance is found to be essential even in computer simulations, let alone 
real-time implementations, especially for high order plants. For addi- 
tional results on the performance of MRAC, the reader is referred to 
[37, 119, 148, 182, 211, 227, 241], 

9.5 Robust APPC Schemes 

In Chapter 7, we designed and analyzed a wide class of APPC schemes for 
a plant that is assumed to be finite dimensional, LTI, free of any noise and 
external disturbances and whose transfer function satisfies assumptions PI 
to P3. 

In this section, we consider APPC schemes that are designed for a sim- 
plified plant model but are applied to a higher-order plant with unmodeled 
dynamics and bounded disturbances. In particular, we consider the higher 
order plant model which we refer to it as the actual plant 

Up = G 0 (s)[l + A m(s)](u p + d u ) (9.5.1) 

where G'o(-s) satisfies PI to P3 given in Chapter 7, A m (s) is an unknown 
multiplicative uncertainty, d u is a bounded input disturbance and the overall 
plant transfer function G(s) = G ? o(s)(l+A m (s)) is strictly proper. We design 
APPC schemes for the lower-order plant model 

Up = G 0 (s)u p (9.5.2) 

but apply and analyze them for the higher order plant model (9.5.1). The ef- 
fect of perturbation A m and disturbance d u on the stability and performance 
of the APPC schemes is investigated in the following sections. 



710 


CHAPTER 9. ROBUST ADAPTIVE CONTROL SCHEMES 


Plant 



Figure 9.8 Closed-loop PPC schemes with unmodeled dynamics and 
bounded disturbances. 


We first consider the nonadaptive case where Go(s) is known exactly. 


9.5.1 PPC: Known Parameters 

Let us consider the control laws of Section 7.3.3 that are designed for the 
simplified plant model (9.5.2) with known plant parameters and apply them 
to the higher order plant (9.5.1). The block diagram of the closed-loop plant 
is shown in Figure 9.8 where C(s) is the transfer function of the controller. 
The expression for C(s) for each of the PPC laws developed in Chapter 7 is 
given as follows. 

For the control law in (7.3.6) and (7.3.11) of Section 7.3.2 which is based 
on the polynomial approach, i.e., 


Q m Lu p = P(y m -y p ), LQ m R p + P Z p = A* (9.5.3) 


where Q m {s) is the internal model of the reference signal y rn . i.e., Q m (s)y m = 
0, we have 


C(s) 


P(s) 

Qm.(s)L(s) 


(9.5.4) 


The control law (7.3.19), (7.3.20) of Section 7.3.3 based on the state- variable 
approach, i.e., 


e 

Up 


Ae + Bu p — K 0 (C T e — e\) 

-T<' " — Rl. - 

Up — Up 

^cm 


(9.5.5) 
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where K c satisfies (7.3.21) and K a satisfies (7.3.22), can be put in the form 
of Figure 9.8 as follows: 

We have 

e(s) = (sI-A + K 0 C t )~\Bu p + K a e i) 

and 

u p = - K c {sl - A + K 0 C t )-\Bu p + K 0 e\) 

i.e., 

_ _ K c (sI-A + K 0 C t )~ 1 K 0 
Up 1 + K c (sl - A + K a C T )- 1 B 61 

and, therefore, 


r() _ K c (sl — A + K 0 C t )~ 1 K 0 Qi(s) 
[S) (1 + K c (sl -71 + K 0 C T )~'B) Q m (s ) 


(9.5.6) 


For the LQ control of Section 7.3.4, the same control law (9.5.5) is used, 
but K c is calculated by solving the algebraic equation 


A T P + PA-PBX~ 1 B T P + CC T = 0, K c = X~ 1 B t P (9.5.7) 


Therefore, the expression for the transfer function C(s) is exactly the same 
as (9.5.6) except that (9.5.7) should be used to calculate K c . 

We express the closed-loop PPC plant into the general feedback form 
discussed in Section 3.6 to obtain 


u p 
Up 

where G = Go(l + A m ) is the overall transfer function and C is different 
for different pole placement schemes. The stability properties of (9.5.8) are 
given by the following theorem: 

Theorem 9.5.1 The closed-loop plant described by (9.5.8) is internally sta- 
ble provided 

||T 0 (s)A m (s)|| oo < 1 

where Tq(s) = i+cq is the complementary sensitivity function of the nomi- 
nal plant. Furthermore, the tracking error e\ converges exponentially to the 
residual set 


C -CG 

1 + CG IT CG 
CG G 

1 + CG 1 + CG 


Vm 

d u 


(9.5.8) 


V e = {ei ||ei| < cd 0 ) 


(9.5.9) 



712 


CHAPTER 9. ROBUST ADAPTIVE CONTROL SCHEMES 


where do is an upper bound for \d u \ and c > 0 is a constant. 


Proof The proof of the first part of Theorem 9.5.1 follows immediately from equa- 
tion (9.5.8) and the small gain theorem by expressing the characteristic equation of 
(9.5.8) as 


1 + 


CG o 
1 + CG 0 


A,- 


= 0 


To establish (9.5.9), we use (9.5.8) to write 


ei 


CG J , G J 1 , G 

1 + CG ~ Vm+ 1 + CG du ~ 1 + CG Vm + 1 + CG 


It follows from (9.5.4), (9.5.6) that the controller C(s) is of the form C(s) = Q°fy 
for some C'o(s). Therefore 


Qm . GQ m 

ei ~~ " Q m + C 0 G ym + Q m + C 0 G u 

where G = Gq(1 + A m ). Since Q m y m = 0 and the closed-loop plant is internally 
stable due to ||T 0 (s)A m (s)|| oo < 1, we have 


ei 


(1 + N m )GoQn 


Qm + CqG o + A m C 0 G 0 


£t 


(9.5.10) 


where e t is an exponentially decaying to zero term. Therefore, (9.5.9) is implied by 
(9.5.10) and the internal stability of the closed-loop plant. □ 


It should be pointed out that the tracking error at steady state is not 
affected by y rn despite the presence of the unmodeled dynamics. That is, if 
d u = 0 and A m / 0, we still have ei(i) — > 0 as t — > oo provided the closed- 
loop plant is internally stable. This is due to the incorporation of the internal 
model of y m in the control law. If Q m (s ) contains the internal model of d u 
as a factor, i.e., Q m {s) = Qd(s)Q m (s ) where Qd{s)d u = 0 and Q m {s)y m = 0, 
then it follows from (9.5.10) that ei = et, i.e., the tracking error converges 
to zero exponentially despite the presence of the input disturbance. The 
internal model of d u can be constructed if we know the frequencies of d u . 
For example, if d u is a slowly varying signal of unknown magnitude we could 
choose Qd(s) = s. 

The robustness and performance properties of the PPC schemes given by 
Theorem 9.5.1 are based on the assumption that the parameters of the mod- 
eled part of the plant, i.e., the coefficients of G'o(-s) are known exactly. When 
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the coefficients of G'o(s) are unknown, the PPC laws (9.5.3) and (9.5.5) are 
combined with adaptive laws that provide on-line estimates for the unknown 
parameters leading to a wide class of APPC schemes. The design of these 
APPC schemes so that their robustness and performance properties are as 
close as possible to those described by Theorem 9.5.1 for the known param- 
eter case is a challenging problem in robust adaptive control and is treated 
in the following sections. 


9.5.2 Robust Adaptive Laws for APPC Schemes 

The adaptive control schemes of Chapter 7 can meet the control objective 
for the ideal plant (9.5.2) but not for the actual plant (9.5.1) where A m (s) / 
0 ,d u / 0. The presence of A m and/or d u may easily lead to various types 
of instability. As in the case of MRAC, instabilities can be counteracted 
and robustness properties improved if instead of the adaptive laws used in 
Chapter 7, we use robust adaptive laws to update or estimate the unknown 
parameters. 

The robust adaptive laws to be combined with PPC laws developed for 
the known parameter case are generated by first expressing the unknown 
parameters of the modeled part of the plant in the form of the parametric 
models considered in Chapter 8 and then applying the results of Chapter 8 
directly. 

We start by writing the plant equation (9.5.1) as 


R p y p = Z p (l + A m )(u p + d u ) (9.5.11) 

where Z p = 0^ T a n _i(s), R p = s n + 0* T a n _i(s); 0^,9* are the coefficient 


= [s n_1 , s n ~ 2 , . . . , s, 1] T . Filtering 
each side of (9.5.11) with A , where A p (s) = s n + Ais n_1 + • • • + A n is 
Hurwitz, we obtain 

(9.5.12) 


vectors of Z p , R p respectively and a n _i(s) = [s'"' ", s 

lere 

z = 9* T 4> + y 


where 


z = 


A P (s) 


Up i 


r b T ,e* a T ] T 




<*n-l(g) 

A„ 


T / \ 1 T 

a n _i(s) 

U Pl A VP 

lYrt 


, 7J = — ^[A m u p + (1 + A m )d u \ 

l\p 



714 


CHAPTER 9. ROBUST ADAPTIVE CONTROL SCHEMES 


Equation (9.5.12) is in the form of the linear parametric model considered 
in Chapter 8, and therefore it can be used to generate a wide class of robust 
adaptive laws of the gradient and least-squares type. As we have shown 
in Chapter 8, one of the main ingredients of a robust adaptive law is the 
normalizing signal m that needs to be chosen so that ^ ^ G £oo . We apply 
Lemma 3.3.2 and write 


\v{t) | < 


Z p (s) 


A P 0) 


A m ,(-s) 


1 1 1 2<5 ~ b 


2<5 


Zp(s)(l + A m (s)) 


A P (s) 


-4+U (9.5.13) 

26 


for some d > 0, where et is an exponentially decaying to zero term and 
do = sup t \d u (t)\. Similarly, 


1^)1 < Y , 

2=1 



\\(y P )ths) 


(9.5.14) 


The above H 25 norms exist provided 1/A p (s) and A m (s) are analytic in 
Re[s] > —d /2 and ^ m is strictly proper. Because the overall plant transfer 
function G(s) and Go(s) are assumed to be strictly proper, it follows that 
G oA m and therefore A — are strictly proper. Let us now assume that 
A m (s) is analytic in Re[s] > —5q/2 for some known do > 0. If we design 
Ap(s) to have roots in the region Re[s] < — do/2 then it follows from (9.5.13), 
(9.5.14) by setting d = do that the normalizing signal m given by 


m 2 — 1 + ll^piH^o + WVptWho 


bounds 77, (j) from above. The signal m may be generated from the equations 

2 1 1 2 2 

m = 1 + n s , n s = m s 

m s = -d 0 m s + u 2 + y 2 , m s ( 0) = 0 (9.5.15) 

Using (9.5.15) and the parametric model (9.5.12), a wide class of robust 
adaptive laws may be generated by employing the results of Chapter 8 or by 
simply using Tables 8.2 to 8.4. 

As an example let us consider a robust adaptive law based on the gradient 
algorithm and switching (7-modification to generate on-line estimates of 9* 
in (9.5.12). We have from Table 8.2 that 


p 


Te<f> — a s T 6 p 
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z — 9j(j> 2 2 2 

e = 7 . — , m =1 + n,, n„=m s 

rn z 

s n 

z = vT ) 2 " 1 (9 ' 5J6) 

= -S 0 rn s + Up + y p , m s ( 0) = 0 
[ 0 if |0 P | < M 0 

^ = S *o(ffi-l) if M 0 < |0 P | < 2M 0 
[ (T 0 if |0 P | > M 0 

where 0 P is the estimate of 9*, Mq > \9*\, <7o > 0 and T = T t > 0. As estab- 
lished in Chapter 8, the above adaptive law guarantees that (i) e, en s , 9 p , 9 p e 
£oo, (ii) e,e7i s , 6 P £ S(7] 2 /m 2 ) independent of the boundedness of 4>,z,m. 

The adaptive law (9.5.16) or any other robust adaptive law based on para- 
metric model (9.5.12) can be combined with the PPC laws (9.5.3), (9.5.5) 
and (9.5.7) to generate a wide class of robust APPC schemes as demonstrated 
in the following sections. 

9.5.3 Robust APPC: Polynomial Approach 

Let us combine the PPC law (9.5.3) with a robust adaptive law by replac- 
ing the unknown polynomials R p , Z p of the modeled part of the plant with 
their on-line estimates R p (s,t ), Z p (s,t) generated by the adaptive law. The 
resulting robust APPC scheme is summarized in Table 9.5. 

We like to examine the properties of the APPC schemes designed for 
(9.5.2) but applied to the actual plant (9.5.1) with A m (s) / 0 and d u / 0. 

As in the MRAC case, we first start with a simple example and then 
generalize it to the plant (9.5.1). 

Example 9.5.1 Let us consider the plant 

y P = — b — (1 + A m{s))u p (9.5.17) 

s + a 

where a,b are unknown constants and A m (s) is a multiplicative plant uncertainty. 
The input u p has to be chosen so that the poles of the closed-loop modeled part 
of the plant (i.e., with A m (s) = 0) are placed at the roots of A*(s) = (s + l) 2 
and y p tracks the constant reference signal y rn = 1 as close as possible. The same 
control problem has been considered and solved in Example 7.4.1 for the case where 
A m (s) = 0. 
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Table 9.5 Robust APPC scheme: polynomial approach 


Actual plant 

Up ~ + d u ) 

Plant model 

Vp = r}(J) u p > z p( s ) = 0* b T at{s),R p {s) = s n + 9*J a{s) 
e; = [6* b T ,e* a T } T ,a(s) = a n - 1 (s) 

Reference 

signal 

Qm{s)y m = 0 

Assumptions 

(i) Modeled part of the plant y p = -jf-Up and Q m (s) 
satisfy assumptions PI to P3 given in Section 7.3.1; 

(ii) A m (s) is analytic in Refs] > — 5q/2 for some 
known do > 0 

Robust 
adaptive law 

Use any robust adaptive law from Tables 8.2 to 8.4 
based on the parametric model z = 9* T </> + rj with 

2 = 4(s) vp^ = [ a J sj u p > a JJ ypV 

V = ^[A m (u p + d u ) + d u \ 

Calculation of 

controller 

parameters 

Solve for L(s,t) = s n ~ 1 +l T (t)a n - 2 (s) 
P(s,t)=P T {t)a n+q -i(s) from the equation 

L(s,t) ■ Qm(s ) • Rp(s,t ) + p(s,t) ■ Z p (s,t ) = A*(s), 
where Z p (s,t ) = dj(t)a(s), R p (s,t) = s n + 9j (t)a(s) 

Control law 

Up = (A — LQm)-^Up P-jy('jJp J/m) 

Design 

variables 

A(s) = Ap(s)A g (s), A p ,A q are rnonic and Hurwitz 
of degree n and q — 1 respectively and with roots 
in Refs] < — So/2; A*(s) is rnonic Hurwitz of degree 
2n+q— 1 with roots in Refs] < — 5q/2) Q m (s ) is rnonic 
of degree q with nonrepeated roots on the jA-axis 
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We design each block of the robust APPC for the above plant as follows: 
Robust Adaptive Law The parametric model for the plant is 


e; T 


< i> 


+ 


v 


where 


2 = 


s T A 


s T A 
u p , -y p \ T 


y p , 9* = [b,a\ 


V = 


s T A 


A m (s) u p 


We assume that A m (s) is analytic in Re[s] > —5q/2 for some known 5o > 0 and 
design A > So/2. Using the results of Chapter 8, we develop the following robust 
adaptive law: 


9 p = 

e = 


Pr{T(e0- 

-cr s 6» p )}, 

T : 

II 

Tl 

H 

V 

o 

z ~ Oj<t> 

9 ? 

m 2 = 1 4 

n 2 s , 

, n 2 = m s 

m z 




- S 0 m s + 

\u P \ 2 + \y P \ 

2 

1 

m. s ( 0) = 0 

f o 

if 

K\ 

< M 0 

J \ M 

- l) cr 0 if 

M 0 

< \0 P \ < 2 Mo 

[ O’ 0 

if 

\o P \ 

> 2 M 0 


(9.5.18) 


where 9 p = [6, a], Pr{-} is the projection operator which keeps the estimate |6(t)| > 
b 0 > 0 Vf > 0 as defined in Example 7.4.1 where b 0 is a known lower bound for |6|; 
and Mo > \6*\,ao > 0 are design constants. 

Control Law The control law is given by 


u p — . , 

s + A 


u p - (pis +p 0 )— l -^(y P ~ Vm) 

S A 


(9.5.19) 


where p ± , po satisfy 


s ■ (s + a) + (pis + po) ■ b = (s + l) 2 


leading to 



As shown in Chapter 8, the robust adaptive law (9.5.18) guarantees that 


(i) e, em, 9 G C <*,, (ii) e, em, 9 G Sir/ 2 /m 2 ) 
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Because of the projection that guarantees \b(t)\ > bo > 0 Vf > 0, we also have that 
Poj.Pi £ Coo and p 0 ,p i £ S(y 2 /m 2 ). Because 

= A 2 (9.5.20) 

28 q 


H 


< 


s A 


A m (s) 


we have p 0 ,p 1 ,e, em, 9 £ <S(A 2 ). 

We use the properties of the adaptive law to analyze the stability properties of 
the robust APPC scheme (9.5.18) and (9.5.19) when applied to the plant (9.5.17) 
with A m (s) 0. The steps involved in the analysis are the same as in the ideal 
case presented in Chapter 7 and are elaborated below: 


Stepl. Express u p ,y p in terms of the estimation error. As in Example 7.4.1, 
we define the states 


0i 


sTa“ p ’ 


02 


c\ Vp ' 


0m 


1 

i 7 
s + A 


From the adaptive law, we have 


em 2 = 


s qT 1 


s A 


Up 


s A 


[ u p j Vp\ 


-02 - 9 p [0!, 0 2 ] T 


i.e., 


(j >2 = ~b(f> i — a0 2 — em 2 (9.5.21) 

From the control law, we have 

Up = X(f>i + Pl0 2 + 0002 + Po0m + Pi 0m 
Because u p = <fi± + A0i, it follows from above and the equation (9.5.21) that 

0i = -Pib0i - (pi« - Po) 02 - Piem 2 + y m (9.5.22) 


where y m = pi0 m +Po0m £ Coo due to po,Pi £ Coo ■ Combining (9.5.21), (9.5.22), 
we obtain exactly the same equation (7.4.15) as in the ideal case, i.e., 


x = A{t)x + bi(t)em 2 + 6 2 </ m 
u p = x\ + \x\, y p = —X 2 ~ \x 2 

where x = [aq,a; 2 ] T = [<^>i , <0> 2 ] T , 


m 


-pib -{pia-p 0 ) 
—b —a 


bi(t) 


-Pi 
-1 ’ 


b 2 


(9.5.23) 


1 

0 


Note that in deriving (9.5.23), we only used equation em 2 = z — 9^ (f> and the 
control law (9.5.19). In both equations, y or A , n (s) do not appear explicitly. 
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Equation (9.5.23) is exactly the same as (7.4.15) and for each fixed t, we have 
detfsl — A(t)) = (s + l) 2 . However, in contrast to the ideal case where em, || A(t) || £ 
T-2 f) £oo> here we can only establish that em, ||A(f)|| £ S( A 2 ) f) Coo, which follows 
from the fact that em.,9 £ ^(Aj) f)^oo 5 ^ G Coo and \b(t)\ > b 0 guaranteed by the 
adaptive law. 

Step 2. Show that the homogeneous part of (9.5.23) is e.s. For each fixed 
t, det(sl — A(f)) = (s + a)s + b(p\s + po) = (s + l) 2 , i.e., A(A(t)) = — 1 ,Vt > 

0. From a,b,p lt p 0 £ S(A^)f)C 00 , we have \\A(t)\\ £ <S(A|)f|^oo. Applying 
Theorem 3.4.11(b), it follows that for A 2 < A* for some A* > 0, the matrix Aft) 
is u.a.s. which implies that the transition matrix 4>(t,r) of the homogeneous part 
of (9.5.23) satisfies 

||$(t,r)|| < k 1 e~ k2(t ^ T \ V< > r > 0 
for some constants k \ , k-2 > 0. 

Step 3. Use the B-G lemma to establish signal boundedness. Proceeding the 
same way as in the ideal case and applying Lemma 3.3.3 to (9.5.23), we obtain 

kWI.IMI < c||em 2 || +c 

where ||(-)|| denotes the C25 norm IKOtlk for any 0 < 6 < min[^o, 2 ^ 2 ] and c > 0 
denotes any finite constant. From (9.5.23), we also have 

IKII, \\y P \\ < c\\em 2 \\+c 

Therefore, the fictitious normalizing signal m 2 = 1 + ||u p || 2 + \\y p \\ 2 satisfies 

to 2 < c||e?n 2 || 2 + c 

The signal m/ bounds m,x,u p ,y p from above. This property of to/ is established 
by using Lemma 3.3.2 to first show that fa/rrif and therefore x/mf £ Coo- 

From <5 < 6 0 we also have that to/to/ £ Coo- Because the elements of A(t) are 
bounded (guaranteed by the adaptive law and the coprimeness of the estimated 
polynomials) and em £ Coo, it follows from (9.5.23) that x/mf £ Coo and therefore 
u p /mf, yp/mf £ Coo- The signals 4> x, u p , y p can also be shown to be bounded 
from above by to due to A > Sq/2. We can therefore write 

to 2 < c||emm/|| 2 + c = c + c f e~ s< ' t ~ T ^g 2 (T)m‘j (r)dr 

Jo 

where g = em £ ^(A^). Applying B-G Lemma III, we obtain 

? lT)dT +c6 f e-^-^e 0 f s' 9 * {T)dT ds 
Jo 


m 2 f < ce St e' 


r 

Jo 
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Because c f* g 2 (r)dr < cA|(f — s) + c, it follows that for cA 2 < S, we have to/ € Coo. 
From to/ € £„o, we have x,u p ,y p ,m £ Coo and therefore all signals are bounded. 
The condition on A 2 for signal boundedness is summarized as follows: 

cA 2 < mm[VS, cA*], 0 < <5 < min[<$o, 2/c 2 ] 


where as indicated before A* > 0 is the bound for A 2 for A(t) to be u.a.s. 


Step 4. Establish tracking error bounds. As in the ideal case considered in 
Example 7.4.1 (Step 4), we can establish by following exactly the same procedure 
that 


s(s + A) 2 s + A 1 fl \ 
ei = , , - , . ,vd «— -\Vp ~ 0— t u p) 


(s + 1) 2 


(s+l) 2V s + A* p s + A 


(9.5.24) 


This equation is exactly the same as in the ideal case except that em, a,b £ 5(A|) 
instead of being in £ 2 . Because y p ,u p ,m £ Coo, it follows that 

em 2 , (a— l —y p - b-^-Up) £ 5(A|) 

S t A S ~r A 

Therefore by writing = 1 + ? using em 2 € <S(A 2 ) and applying 

Corollary 3.3.3 to (9.5.24), we obtain 


e\dr < cA\t + c (9.5.25) 

which implies that the mean value of e 2 is of the order of the modeling error char- 
acterized by A 2 . 

Let us now simulate the APPC scheme (9.5.18), (9.5.19) applied to the plant 
given by (9.5.17). For simulation purposes, we use a=— 1,6=1 and A m (s)= 

The plant output response y p versus t is shown in Figure 9.9 for different values 
of /I that indicate the size of A m (s). As /i increases from 0, the response of y p 
deteriorates and for g = 0.28, the closed loop becomes unstable. V 



Remark 9.5.1 The calculation of the maximum size of unmodeled dynam- 
ics characterized by A 2 for robust stability is tedious and involves sev- 
eral conservative steps. The most complicated step is the calculation 
of A* using the proof of Theorem 3.4.11 and the rate of decay of the 
state transition matrix of A(t). In addition, these calculations involve 
the knowledge or bounds of the unknown parameters. The robustness 
results obtained are therefore more qualitative than quantitative. 



n=0.0 

ja=0.05 

(. 1 = 0. 1 

(- 1 = 0.2 


Figure 9.9 Plant output response for the APPC scheme of Example 9.5.1 
for different values of /i. 

Remark 9.5.2 In the above example the use of projection guarantees that 
\b(t)\ > bo > 0 Vf > 0 where bo is a known lower bound for |6| and 
therefore stabilizability of the estimated plant is assured. As we showed 
in Chapter 7, the problem of stabilizability becomes more difficult to 
handle in the higher order case since no procedure is yet available 
for the development of convex parameter sets where stabilizability is 
guaranteed. 

Let us now extend the results of Example 9.5.1 to higher order plants. 
We consider the APPC scheme of Table 9.5 that is designed based on the 
plant model (9.5.2) and applied to the actual plant (9.5.1). 

Theorem 9.5.2 Assume that the estimated polynomials R p (s , t ) , Z p (s , t) of 
the plant model are such that R p Q m , Z p are strongly coprime at each time t. 
There exists a 5 * > 0 such that if 

c(/ 0 + A 2 ) < 5*, where A 2 = A m (s) 

2Sq 
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then the APPC schemes of Table 9.5 guarantee that all signals are bounded 
and the tracking error e\ satisfies 

[ e\dr < c( + (Iq + fifit + c, Vf > 0 

Jo 

where fo = 0 in the case of switching-a and projection and fo > U in the 
case of fixed-a(fo = a), dead zone (fo = go) and e-modification (fo = ^o) 
and do is an upper bound for \d u \. 

The proof of Theorem 9.5.2 for d u = 0 follows directly from the analysis 
of Example 9.5.1 and the proof for the ideal case in Chapter 7. When d u 0 
the proof involves a contradiction argument similar to that in the MRAC 
case. The details of the proof are presented in Section 9.9.1. 


Remark 9.5.3 As discussed in Chapter 7, the assumption that the esti- 
mated time varying polynomials R p Q m ,Z p are strongly coprime can- 
not be guaranteed by the adaptive law. The modifications discussed 
in Chapter 7 could be used to relax this assumption without changing 
the qualitative nature of the results of Theorem 9.5.2. 

9.5.4 Robust APPC: State Feedback Law 

A robust APPC scheme based on a state feedback law can be formed by 
combining the PPC law (9.5.5) with a robust adaptive law as shown in 
Table 9.6. The design of the APPC scheme is based on the plant model 
(9.5.2) but is applied to the actual plant (9.5.1). We first demonstrate the 
design and analysis of the robust APPC scheme using the following example. 

Example 9.5.2 Let us consider the same plant as in Example 9.5.1, 

y P = s ^T a ( 1 + A m (s))u p (9.5.26) 

which for control design purposes is modeled as 

b 


where a,b are unknown and A m (s) is a multiplicative plant uncertainty that is 
analytic in Re[s] > —do/2 for some known do > 0. The control objective is to 
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Table 9.6 Robust APPC scheme: state feedback law 


Actual plant 

Up — ^(1 + Am) (v,p + d u ) 

Plant model 

$ 

II 

Reference 

signal 

Qm{s)y m = 0 

Assumptions 

Same as in Table 9.5 

Robust 
adaptive law 

Same as in Table 9.5; it generates the estimates 
Z P (s, 1) 

State observer 

e = Ae + BUp — K 0 (t)(C T e — ei), e G 7£” +9 

Ri+q— 1 

A= -0r(f) ,B = 02(t), C T = [1,0, . . . ,0] 

0 

RpQm = S n+9 + # 1 ~ (t)a n+g _l(s) 

EpQ 1 = $2 (^)c^n+(jr— l(s) 

A'o = a* - 8i,A*(s) = s n+q + a* T a n +q-i{s) 

^1 = Up 2/m 

Calculation of 

controller 

parameters 

Solve K c {t) from 

det (sI-A + BK c ) = A*(s) 

at each time t 

Control law 

Up — K c (t)e, Up = Up 

Design 

variables 

Qi(s), A*(s), A*(s) rnonic of degree q, n + q, n + q, 
respectively, with roots in Refs] < — e>o/2; A*(s) 
has Qi(s) as a factor; Q m (s) as in Table 9.5. 
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stabilize the actual plant and force y p to track the reference signal y m = 1 as close 
as possible. As in Example 9.5.1, the parametric model for the actual plant is given 
by 

2 = + n (9.5.27) 

where z = ^y P , $ = [u p , - y p ] T ,0 * = [6, a] T , rj = ^ A m u and A > 0 is chosen 
to satisfy A > S 0 /2. A robust adaptive law based on (9.5.27) is 

d p = Pr[r(e</> - cr s 0p )] (9.5.28) 

where 9 P = [b, a] T ;Pr(-) is the projection operator that guarantees that \b(t)\ > 
6o > 0 Vt > 0 where bo is a known lower bound for \b\. The other signals used in 
(9.5.28) are defined as 


Z & 2 , 2 2 

e = o — > m = 1 + rz„ , n„ = m s 

m z 

m s = -(5 0 m s + u p + y p , m s ( 0) = 0 (9.5.29) 


and a, is the switching tr- modification. Because y m = 1 we have Q m (s) = s and 
select Qi(s) = s + 1, i.e., we assume that <5o < 2. Choosing K a = [10,25] T — [a, 0] T , 
the state-observer is given by 



—a 1 


' 1 ' 


—a + 10 

e = 

0 0 

e + 

1 

bu p — 

25 


([lj0]e - ei) 


(9.5.30) 


where the poles of the observer, i.e., the eigenvalues of A — KqC, are chosen as 
the roots of A„(s) = s 2 + 10s + 25 = (s + 5) 2 . The closed-loop poles chosen as 
the roots of A*(s) = (s + l) 2 are used to calculate the controller parameter gain 
K c (t) = [k u k 2 \ using 

det (si - A + BK C ) = (s + l ) 2 


which gives 


The control law is given by 



= — [/ci,fc 2 ]e, 


u v = 


s + 1 . 


(9.5.31) 


The closed-loop plant is described by equations (9.5.26) to (9.5.31) and analyzed 
by using the following steps. 

Step 1 . Develop the state error equations for the closed-loop plant. We start 
with the plant equation 


(s + a)y p = 6(1 + A m (s))u p 
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which we rewrite as 


(s + a)sy p = 6(1 + A m (s))su p 

Using sy p = sei and filtering each side with = yyy, we obtain 


5 S 

o + a) — — e 1 = 6(1 + A m )u p , = — — u p 

S + 1 5+1 


which implies that 


b(s + 1) . . 

ei = , , — — (1 + A m )u p 


s(s + a) 

We consider the following state representation of (9.5.32) 


(9.5.32) 


e = 


— a 1 
0 0 


ei = [1, 0]e + r], y = 


s + A 


bUry 


A m (s)t 


A — a -f 1 
A 


(9.5.33) 


Let e 0 = e — e be the observation error. It follows from (9.5.30), (9.5.33) that e 0 
satisfies 


(9.5.34) 



' -10 

1 ' 


' 1 ' 


■ i ■ 


A — 9 

e Q = 

-25 

0 

e 0 + 

0 

ae i — 

i 

bu p + 

A - 25 


The plant output is related to e, e G , ei as follows: 

y v = C T e 0 + C r e + r] + y m , e 1 =y p -y 71 


(9.5.35) 


where C' T = [1,0]. A relationship between u p and e,e 0 ,rj that involves stable 
transfer functions is developed by using the identity 


s(s + a) | (2 — a)s + l 
(s + l) 2 + (s + 1) 2 


= 1 


(9.5.36) 


developed in Section 7.4.3 (see (7.4.37)) under the assumption that b,s(s + a) are 
coprime, i.e., b ^ 0. From (9.5.36), we have 

s(s + a) (2 — a)s + 1 


(s + 1) 


2 “P 


(s + iy 


Using su p = (s+l)u p = — (s+l)K c e and u p = ^^y p — A m u p in the above equation, 
we obtain 

<.„= - S -M\h.h]i+ l(2 ~?! 8 + 1 1 ! ( 2 ’ + '‘ ) !/ r - Hi* + A> ’> ( 9 . 5 . 37 ) 


s+1 1 


b(s + l) 2 


6(s + 1) 2 
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Substituting for u p = —K c (t)e in (9.5.30) and using C T e — e\ = —C T e 0 — r], 
C T = [1,0], we obtain 

'e = A c (t)e + ~ °25 10 (° Te o + 9) ( 9.5.38 ) 


where 

Mt)= l ~b \ [kM= _ ( ^ a) _° x (9-5.39) 

Equations (9.5.34) to (9.5.39) are the error equations to be analyzed in the steps to 
follow. 

Step 2. Establish the e.s. of the homogeneous parts of (9.5.34) an d (9.5.38). 
The homogeneous part of (9.5.34), i.e. , 


Y 0 = AY 0 , A = 


-10 1 ' 
-25 0 


is e.s. because det (si — A) = (s + 5) 2 . The homogeneous part of (9.5.38) is 
Y = A c (t)Y where det(sJ — A c (t)) = (s + 1) 2 at each time t. Hence, A (A c (t)) = — 1 

Vt > 0. The adaptive law guarantees that \b(t) \ > b 0 > 0 V< > 0 and a, b, a, b. k, k € 
Coo and a, b, ki € S(rj 2 /m 2 ). Because ^ < b A = A 2 , it follows that 

2So 

||H C (<)|| e 5(A 2 ). Applying Theorem 3.4.11(b), we have that A c (t) is e.s. for 
< A* and some A* > 0. 

Step 3. Use the properties of the C 25 norm and B-G lemma to establish sig- 
nal boundedness. Let us denote ||(-)t|| 2<5 for some S £ (0, <5o] with ||(-)||. Using 
Lemma 3.3.2 and the properties of the £ 2 < 5 -norm, we have from (9.5.35), (9.5.37) 
that for 5 < 2 

\\y P \\<\\C T e 0 \\+c\\e\\ + \\ V \\ + c 

IKII<c(PII + IM + NI) 

Combining the above inequalities, we establish that the fictitious normalizing signal 
m 2 = 1 + ||up|| 2 + \\y p \\ 2 satisfies 

m 2 < c+ c||C T e 0 || 2 + c||e|| 2 + c||? 7 || 2 

If we now apply Lemma 3.3.3 to (9.5.38), we obtain 

||e|| < c(||C T e 0 || + IMI) 


and, therefore, 


m 2 f < c + c||C T e 0 || 2 + c||77|| 2 


(9.5.40) 
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To evaluate the term ||C T e 0 || in (9.5.40), we use (9.5.34) to write 

-r s „ s+l~s (A — 9)s + A — 25 

c e " = “ (JTWT+i ’ 1 ' + iJTW ” 

Applying the Swapping Lemma A.l and using the fact that sei = sy pi we have 

c ' Te ° = 5 (7TW p ~*(7T^“'' (9 ’ 5 ’ 41) 

+W a (Wne 1 )i - W C 2 {W„u p )'b+ (A ~ ~ 25) , ; 

where W C j(s), i4'7„(.s) are strictly proper transfer functions with poles at —5. The 
first two terms on the right side of (9.5.41) can be related to the normalized esti- 
mation error. Using z = 6* T <p + V we express equation (9.5.29) as 

an 2 = 9* T + r] — 9p <f> 

= a—*— rU P - b— -Up + rj (9.5.42) 

5 t A S t A 

We now filter each side of (9.5.42) with and apply the Swapping Lemma A.l 

to obtain 

tw” = a (7+W’ Jr - <JTW" r ( > 

+ W c ( I i + + A b 

where W c , IT), are strictly proper transfer functions with poles at —5. Using (9.5.43) 
in (9.5.41) we obtain 

C T e 0 = ^ + + 5 ^ em ' 2 - W c {(W b y p )h - {W b u p )b] 

l s 2 + 9s + 25 — A 

+W c i(W bl e 1 )a - W c2 (W b 2 U p )b - ‘ + 17 

Using the fact that W(s)y p , W(s)u p are bounded from above by m/ for any strictly 
proper W(s) that is analytic in i?e[s] > — | and m < mj, we apply Lemma 3.3.2 
to obtain 

||C T e 0 || < c\\emmf\\ + c||om/|| + c|| 6 m/|| + c||jy|| (9.5.44) 

which together with (9.5.40) imply that 


m 2 f < c + c|| 3 ?n/|| 2 + c||? 7|| 2 
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where g 2 = |a| 2 + |6| 2 + |em| 2 . Since 



m f 


bA m (s) 
s T A 


= A 


OO 


00(5 


we have 

m 2 f < c + c|| < 7777 ./ 1| 2 + cA^m 2 

Therefore, for 

< 1 

we have ^ 

mj < c + c\\gmf \\ 2 = c + c ( e~ s ^~ T ^ g 2 (T)m 2 (t)cIt 

J o 

Applying B-G Lemma III we obtain 

S 2 CUr + cS f* e -5(t-s) e cf s 9 2 (T)dT ds 
JO 

Because c f* g 2 (r)dT < cA 2 (f — s) + c, it follows that for 

cA^ < S 

we have m/ £ Coo- The boundedness of all the signals is established as follows: 
Because m < to/, ||7/|| < AqoTO/, \r/\ < A 2 to, we have to, ||??||,?? € Coo- Applying 
Lemma 3.3.3 to (9.5.38) and (9.5.34), we obtain 

p||,|e(i)| < c||C' T e 0 || + c||ry|| 

poll, |e c (t)| < cp|| + c|b?|| + cpill (9.5.45) 

Pill < H^eoll + c||e|| + || 7 7 || 


r 2 < ce St e 


f 

JO 


From e,m,mf,\\r]\\,a,b € Coo , it follows from (9.5.44) that ||C T e 0 || £ Coo and 
therefore using the above inequalities, we have ||e||, |e|, ||e 0 ||, |e G |, ||ei|| £ Coo- From 
(9.5.35), (9.5.37) and ij £ Coo, it follows that y p , u p £ Coo which together with 
e = e Q + e and e Q , e £ Coo, we can conclude that all signals are bounded. 

The conditions on A m (s) for robust stability are summarized as follows: 


cA 2 < min[cA*, VS\, cA^ < 1 


A 2 


bA m (s) 
s T A 


2 Sn 


A, 


bA m (s) 
s T A 


006 


where A* is a bound for the stability of A c (t) and 0 < 6 < So is a measure of the 
stability margin of A c (t) and c > 0 represents finite constants that do not affect the 
qualitative nature of the above bounds. 
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Step 4. Establish tracking error bounds. We have, from (9.5.35) that 

ei = e 0 + C T e + ? 7 

Because (9.5.44), (9.5.45) hold for 6 = 0, it follows that 


Mil = 


e?(r)dr<||(C T e 0 ) t ||^ + c||e t ||2 + ||r, t ||2 
Jo 

< c\\{9mf) t \\l + c\\rit\\l 


Because rrif G £oo 7 ^ < A2 and \\gtW 2 = f 0 g 2 ( r )dr < A%t + c, we have 

J e\dr < cA\t + c 
and the stability analysis is complete. 


V 


Let us now extend the results of Example 9.5.2 to the general scheme 
presented in Table 9.6. 

Theorem 9.5.3 Assume that R p (s,t)Q m (s), Z p (s,t) are strongly coprime 
and consider the APPC scheme of Table 9.6 that is designed for the plant 
model but applied to the actual plant with A m , d u / 0. There exists constants 
d*, > 0 such that for all A m (s) satisfying 

A 2 2 + f 0 <6*, Aoo^A^ 


where 


A A 

Aoo = 


Z p {s) 


A (s) 


A m (s) 


A A 

A2 = 


00(5 


Z p {s) 


A (s) 


A m (s) 


2<5q 


and A(s) is an arbitrary polynomial of degree n with roots in Refs] < — nf, all 
signals in the closed-loop plant are bounded and the tracking error satisfies 

[ ej(r) dr < c( + dg + fo)t + c 
Jo 

where /o = 0 in the case of switching a and projection and /o > 0 in the 
case of fixeda (fo = cr), e-modification (fa = vq), and dead zone (fa = go)- 

The proof for the input disturbance free (d u = 0) case or the case where — 
is sufficiently small follows from the proof in the ideal case and in Exam- 
ple 9.5.2. The proof for the general case where — is not necessarily small 
involves some additional arguments and the use of the C 28 norm over an 
arbitrary time interval and is presented in Section 9.9.2. 
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9.5.5 Robust LQ Adaptive Control 

Robust adaptive LQ control (ALQC) schemes can be formed by combining 
the LQ control law (9.5.5), where K c is calculated from the Riccati Equation 
(9.5.7), with robust adaptive laws as shown in Table 9.7. 

The ALQC is exactly the same as the one considered in Section 9.5.4 
and shown in Table 9.6 with the exception that the controller gain matrix 
K c is calculated using the algebraic Riccati equation at each time t instead 
of the pole placement equation of Table 9.6. 

The stability properties of the ALQC scheme of Table 9.7 applied to the 
actual plant are the same as those of APPC scheme based on state feedback 
and are summarized by the following theorem. 


Theorem 9.5.4 Assume that R p (s,t)Q m (s), Z p (s,t) o,re strongly coprime. 
There exists constants (5*, A^ > 0 such that for all A m (s) satisfying 

A2 + /o<<P, Aoo^A^ 

where 


A A 

A™ = 


Z p {s) 


A (s) 


A m (s) 


A A 

Ao = 


00(5 


Z p (s) 


A (s) 


A m(s) 


2<5q 


all signals are bounded and the tracking error e\ satisfies 
f e\{r)dT < c(A% + d% + f 0 )t + c 

Jo 

where /o,A(s) are as defined in Theorem 9.5.3. 


Proof The proof is almost identical to that of Theorem 9.5.3. The only difference 
is that the feedback gain K c is calculated using a different equation. Therefore, if 
we can show that the feedback gain calculated from the Riccati equation guaran- 

V 2 

tees that ||A c (f)|| € <S (^5 + /o)> then the rest of the proof can be completed by 
following exactly the same steps as in the proof of Theorem 9.5.3. The proof for 

A 2 

|| A c (t) || e5(^ 2 + fo) is established by using the same steps as in the proof of Theo- 
rem 7.4.3 to show that ||A c (f)|| < c||A(t)||+c||R(<)||. Because ||A(t)||, ||f?(t)|| < c\9 p \ 
and 9 p € S(fj^ + fo) we have \\A c (t)\\ € S(^ + fo)- D 
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Table 9.7 Robust adaptive linear quadratic control scheme 


Actual plant 

Vp ~ R P (s) (1 Am){ u p + d u ) 

Plant model 

dp ~ R p (s) u P 

Reference signal 

Qm(s)y m = 0 

Assumptions 

Same as in Table 9.5 

Robust 
adaptive law 

Same as in Table 9.5 to generate Z p (s,t), R p (s,t) 
and A, B 

State observer 

Same as in Table 9.6 

Calculation 

of controller 
parameters 

K c = A ~ 1 B t P 

A t p + pA- \pbb t p + cc T = o 

C T = [1,0,...,0],P = P T > 0 

Control law 

Up = — K c {t)e , Up = ij^Rp 

Design variables 

A > 0 and Q\. Q m as in Table 9.6 


Example 9.5.3 Consider the same plant as in Example 9.5.2, i.e. , 

Up = : (1 T A m (s))rip 

s + a 

and the same control objective that requires the output y p to track the constant 
reference signal y m = 1. A robust ALQC scheme can be constructed using Table 9.7 
as follows: 

Adaptive Law 


p 


Pr[T(e0- a s 9p)\ 
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Z Op 4* 2 , 2 2 

e = ~ — , to =l + n s , n=m s 

m z 

m s = -S 0 m s + Mp + 2/p, to s (0) = 0 

2 = JTA 14 " ^ = 

d p = [6, a] T 

The projection operator Pr[-] constraints b to satisfy \b(t)\ > b 0 Vt > 0 where 
6 0 is a known lower bound for 6 and a s is the switching a- modification. 

State Observer 


where 


e = Ae + Bu p — K 0 ([ 1 0]e — ei) 


—a 1 


' 1 ' 


' 10 - a ' 

0 0 

, B = b 

1 

, Ko = 

25 


Controller Parameters K c = X 1 B T P where P = P T > 0 is solved pointwise 
in time using 

A T P + PA - PB\~ l B 1 P + CC r = O, C T = [ 1 0] 


Control Law 


= - K c (t)e , 


u v = 


s + 1 . 


V 


9.6 Adaptive Control of LTV Plants 

One of the main reasons for considering adaptive control in applications is to 
compensate for large variations in the plant parameters. One can argue that 
if the plant model is LTI with unknown parameters a sufficient number of 
tests can be performed off-line to calculate these parameters with sufficient 
accuracy and therefore, there is no need for adaptive control when the plant 
model is LTI. One can also go further and argue that if some nominal values 
of the plant model parameters are known, robust control may be adequate 
as long as perturbations around these nominal values remain within certain 
bounds. And again in this case there is no need for adaptive control. In 
many applications, however, such as aerospace, process control, etc., LTI 
plant models may not be good approximations of the plant due to drastic 
changes in parameter values that may arise due to changes in operating 
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points, partial failure of certain components, wear and tear effects, etc. In 
such applications, linear time varying (LTV) plant models of the form 

x = A(t)x + B(t)u 

y = C T (t)x + D(t)u (9.6.1) 

where A, B, C, and D consist of unknown time varying elements, may be 
necessary. Even though adaptive control was motivated for plants modeled 
by (9.6.1), most of the work on adaptive control until the mid 80’s dealt 
with LTI plants. For some time, adaptive control for LTV plants whose 
parameters vary slowly with time was considered to be a trivial extension of 
that for LTI plants. This consideration was based on the intuitive argument 
that an adaptive system designed for an LTI plant should also work for a 
linear plant whose parameters vary slowly with time. This argument was 
later on proved to be invalid. In fact, attempts to apply adaptive control to 
simple LTV plants led to similar unsolved stability and robustness problems 
as in the case of LTI plants with modeling errors. No significant progress was 
made towards the design and analysis of adaptive controllers for LTV plants 
until the mid-1980s when some of the fundamental robustness problems of 
adaptive control for LTI plants were resolved. 

In the early attempts [4, 74], the notion of the PE property of certain 
signals in the adaptive control loop was employed to guarantee the exponen- 
tial stability of the unperturbed error system, which eventually led to the 
local stability of the closed-loop time-varying (TV) plant. Elsewhere, the 
restriction of the type of time variations of the plant parameters also led to 
the conclusion that an adaptive controller could be used in the respective 
environment. More specifically, in [31] the parameter variations were as- 
sumed to be perturbations of some nominal constant parameters, which are 
small in the norm and modeled as a martingale process with bounded covari- 
ance. A treatment of the parameter variations as small TV perturbations, 
in an /^ 2 -gain sense, was also considered in [69] for a restrictive class of LTI- 
nominal plants. Special models of parameter variations, such as exponential 
or 1/t-decaying or finite number of jump discontinuities, were considered in 
[70, 138, 181]. The main characteristic of these early studies was that no 
modifications to the adaptive laws were necessary due to either the use of 
PE or the restriction of the parameter variations to a class that introduces 
no persistent modeling error effects in the adaptive control scheme. 
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The adaptive control problem for general LTV plants was initially treated 
as a robustness problem where the effects of slow parameter variations were 
treated as unnrodeled perturbations [9, 66, 145, 222], The same robust adap- 
tive control techniques used for LTI plants in the presence of bounded distur- 
bances and unnrodeled dynamics were shown to work for LTV plants when 
their parameters are smooth and vary slowly with time or when they vary 
discontinuously, i.e. , experience large jumps in their values but the disconti- 
nuities occur over large intervals of time [225]. 

The robust MRAC and APPC schemes presented in the previous sections 
can be shown to work well with LTV plants whose parameters belong to the 
class of smooth and slowly varying with respect to time or the class with 
infrequent jumps in their values. The difficulty in analyzing these schemes 
with LTV plants has to do with the representations of the plant model. In 
the LTI case the transfer function and related input /output results are used 
to design and analyze adaptive controllers. For an LTV plant, the notion 
of a transfer function and of poles and zeros is no longer applicable which 
makes it difficult to extend the results of the LTI case to the LTV one. This 
difficulty was circumvented in [223, 224, 225, 226] by using the notion of 
the polynomial differential operator and the polynomial integral operator to 
describe an LTV plant such as (9.6.1) in an input-output form that resembles 
a transfer function description. 

The details of these mathematical preliminaries as well as the design 
and analysis of adaptive controllers for LTV plants of the form (9.6.1) are 
presented in a monograph [226] and in a series of papers [223, 224, 225]. 
Interested readers are referred to these papers for further information. 

9.7 Adaptive Control for Multivariable Plants 

The design of adaptive controllers for MIMO plant models is more complex 
than in the SISO case. In the MIMO case we are no longer dealing with a 
single transfer function but with a transfer matrix whose elements are trans- 
fer functions describing the coupling between inputs and outputs. As in the 
SISO case, the design of an adaptive controller for a MIMO plant can be 
accomplished by combining a control law, that meets the control objective 
when the plant parameters are known, with an adaptive law that generates 
estimates for the unknown parameters. The design of the control and adap- 
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tive law, however, requires the development of certain parameterizations of 
the plant model that are more complex than those in the SISO case. 

In this section we briefly describe several approaches that can be used to 
design adaptive controllers for MIMO plants. 

9.7.1 Decentralized Adaptive Control 

Let us consider the MIMO plant model 

y = H(s)u (9.7.1) 

where y £ 7 Z N ,u £ IZ N and H(s) £ C NxN is the plant transfer matrix that 
is assumed to be proper. Equation (9.7.1) may be also expressed as 

Vi = ha{s)ui + ^ hij(s)uj , i = 1, 2, . . . , N (9.7.2) 

l<j<N 

where hij(s), the elements of H(s), are transfer functions. 

Another less obvious but more general decomposition of (9.7.1) is 

yi = hu(s)ui+ ^ (hij(s)uj + qij(s)yj) , i = l,2,...,N (9.7.3) 

1 <j<N 

for some different transfer functions hij(s),qij(s). 

If the MIMO plant model (9.7.3) is such that the interconnecting or 
coupling transfer functions hij(s),qij(s ) (i / j) are stable and small in some 
sense, then they can be treated as modeling error terms in the control design. 
This means that instead of designing an adaptive controller for the MIMO 
plant (9.7.3), we can design N adaptive controllers for N SISO plant models 
of the form 

yi = hu(s)ui , i = 1,2, . . . , N (9.7.4) 

If these adaptive controllers are designed based on robustness considerations, 
then the effect of the small unmodeled interconnections present in the MIMO 
plant will not destroy stability. This approach, known as decentralized adap- 
tive control , has been pursued in [38, 59, 82, 185, 195]. 

The analysis of decentralized adaptive control designed for the plant 
model (9.7.4) but applied to (9.7.3) follows directly from that of robust adap- 
tive control for plants with unmodeled dynamics considered in the previous 
sections and is left as an exercise for the reader. 
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9.7.2 The Command Generator Tracker Approach 

The command generator tracker (CGT) theory was first proposed in [26] for 
the model following problem with known parameters. An adaptive control 
algorithm based on the CGT method was subsequently developed in [207, 
208]. Extensions and further improvements of the adaptive CGT algorithms 
for finite [18, 100] as well as for infinite [233, 234] dimensional plant models 
followed the work of [207, 208]. 

The adaptive CGT algorithms are based on a plant/reference model 
structural matching and on an SPR condition. They are developed using 
the SPR-Lyapunov design approach. 

The plant model under consideration in the CGT approach is in the state 
space form 


x = Ax + Bu , x(0) = xo 

y = C T x (9.7.5) 

where x £ 77 n ; y, u £ 77 m ; and A, B , and C are constant matrices of appro- 
priate dimensions. The control objective is to choose u so that all signals in 
the closed-loop plant are bounded and the plant output y tracks the output 
y rn of the reference model described by 

%m = A m Xm T B m r x m (0) = X m Q 

y m = C^Xm (9.7.6) 

where x m £ IZ nm ,r £ IZ Pm and y m £ 7 Z m . 

The only requirement on the reference model at this point is that its 
output y m has the same dimension as the plant output. The dimension of 
x m can be much lower than that of x. 

Let us first consider the case where the plant parameters A,R, and C 
are known and use the following assumption. 

Assumption 1 (CGT matching condition) There exist matrices S'Jj , 
S' ( 2 , iS’li, S '22 such that the desired plant input u* that meets the control 
objective satisfies 


X* 


' St 1 

Q* 

°12 



u* 


Q* 

°21 

c* 

°22 


r 


(9.7.7) 
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where x* is equal to x when u = u* , i.e. , x* satisfies 

x* = Ax* + Bu* 
y = c x = y rn 

Assumption 2 (Output stabilizability condition) There exists a ma- 
trix G* such that A + BG*C T is a stable matrix. 

If assumptions 1, 2 are satisfied we can use the control law 

u = G*(y - y m ) + S 21 x m + S 22 r (9.7.8) 

that yields the closed-loop plant 

x = Ax + BG*e i + BS* 21 x m + BS 22 r 

where e\ = y — ym- Let e = x — x* be the state error between the actual and 
desired plant state. Note the e is not available for measurement and is used 
only for analysis. The error e satisfies the equation 

e = (A + BG*C T )e 

which implies that e and therefore ei are bounded and converge to zero 
exponentially fast. If x m , r are also bounded then we can conclude that all 
signals are bounded and the control law (9.7.8) meets the control objective 
exactly. 

If A,B, and C are unknown, the matrices G*,S 21 , and S 22 cannot be 
calculated, and, therefore, (9.7.8) cannot be implemented. In this case we 
use the control law 

u = G(t)(y - y m ) + S 2 i{t)x m + S 2 2 {t)r (9.7.9) 

where G(t), S 2 i(t), S 2 2 {t) are the estimates of G*, S 2l , S 22 to be generated 
by an adaptive law. The adaptive law is developed using the SPR-Lyapunov 
design approach and employs the following assumption. 

Assumption 3 (SPR condition) There exists a matrix G* such that 
C T (si — A — BG*C t )~ 1 B is an SPR transfer matrix. 
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Assumption 3 puts a stronger condition on the output feedback gain 
G* , i.e. , in addition to making A + BG*C stable it should also make the 
closed-loop plant transfer matrix SPR. 

For the closed-loop plant (9.7.5), (9.7.9), the equation for e = x — x* 
becomes 


e = {A + BG*C T )e + BGeiP BSu 
ei = C T e (9.7.10) 

where 

G(t) = G(t) - G*, S{t) = S{t)-S* 

S(t) = [S 21 (t),S 22 (t)}, S* = [S* 21 ,S % 2 ] 
w = i x mH T ] T 

We propose the Lyapunov-like function 

V = e T P c e + tr[G T T^G] + tr[S T T 2 1 5] 

where P c = Pj > 0 satisfies the equations of the matrix form of the LKY 
Lemma (see Lemma 3.5.4) because of assumption 3, and ri,r 2 are symmet- 
ric positive definite matrices. 

Choosing 

G = G = -T iei eJ 

S = S = - P 2 e 1 uj T (9.7.11) 

it follows as in the SISO case that 

V = —e T QQ T e — t' c e T L c e 

where L c = Lj > 0 and v c > 0 is a small constant and Q is a constant matrix. 
Using similar arguments as in the SISO case we can establish that e, G, S 
are bounded and e € C 2 . If in addition x m ,r are bounded we can establish 
that all signals are bounded and e, ei converge to zero as i — ► oo. Therefore 
the adaptive control law (9.7.9), (9.7.11) meets the control objective. 

Due to the restrictive nature of Assumptions 1 to 3, the CGT approach 
did not receive as much attention as other adaptive control methods. As 
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shown in [233, 234], the CGT matching condition puts strong conditions on 
the plant and model that are difficult to satisfy when the reference signal r is 
allowed to be bounded but otherwise arbitrary. If r is restricted to be equal 
to a constant these conditions become weaker and easier to satisfy. The class 
of plants that becomes SPR by output feedback is also very restrictive. In 
an effort to expand this class of plants and therefore relax assumption 3, 
the plant is augmented with parallel dynamics [18, 100], i.e. , the augmented 
plant is described by 


y a = [C T {sI-A)- 1 B + W a {s)]u 

where W a (s) represents the parallel dynamics. If W a (s) is chosen so that 
assumptions 1, 2, 3 are satisfied by the augmented plant, then we can estab- 
lish signal boundedness and convergence of y a to y rn . Because y a / y the 
convergence of the true tracking error to zero cannot be guaranteed. 

One of the advantages of the adaptive control schemes based on the CGT 
approach is that they are simple to design and analyze. 

The robustness of the adaptive CGT based schemes with respect to 
bounded disturbances and unmodeled dynamics can be established using the 
same modifications and analysis as in the case of MRAC with unnormalized 
adaptive laws and is left as an exercise for the reader. 

9.7.3 Multivariable MRAC 

In this section we discuss the extension of some of the MRAC schemes for 
SISO plants developed in Chapter 6 to the MIMO plant model 

y = G(s)u (9.7.12) 

where y £ 7 Z N , u £ 1Z N and G{s) is an N x N transfer matrix. The reference 
model to be matched by the closed-loop plant is given by 

y m = W m (s)r (9.7.13) 

where y m ,r £ IZ N . Because G(s) is not a scalar transfer function, the 
multivariable counterparts of high frequence gain, relative degree, zeros and 
order need to be developed. The following Lemma is used to define the 
counterpart of the high frequency gain and relative degree for MIMO plants. 
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Lemma 9.7.1 For any Nx N strictly proper rational full rank transfer ma- 
trix G(s), there exists a (non-unique) lower triangular polynomial matrix 
fm(s), defined as the modified left interactor (MLI) matrix of G(s), of the 
form 


€m(s) 


di(s) 0 0 0 

h 2 i{s) d 2 (s) 0 ••• 0 

: 0 
hm(s) fyv(jv-i)(«) d N (s) 


where hij(s),j = 1, . . . , N — 1, i = 2, . . . , N are some polynomials and djfs), 
i = 1, . . ., N, are arbitrary monic Hurwitz polynomials of certain degree 
li > 0, such that lim^oo £ m (s)G(s) = K p , the high-frequency- gain matrix of 
G(s), is finite and nonsingular. 


For a proof of Lemma 9.7.1, see [216]. 

A similar concept to that of the left interactor matrix, which was intro- 
duced in [228] for a discrete-time plant is the modified right interactor (MRI) 
matrix used in [216] for the design of MRAC for MIMO plants. 

To meet the control objective we make the following assumptions about 
the plant: 

Al. G(s) is strictly proper, has full rank and a known MLI matrix £ m (s). 
A2. All zeros of G(s) are stable. 

A3. An upper bound uq on the observability index of G{s) is known 
[50, 51, 73, 172], 

A4. A matrix S p that satisfies K p S p = ( K p S p ) T > 0 is known. 

Furthermore we assume that the transfer matrix W m (s) of the reference 
model is designed to satisfy the following assumptions: 

Ml. All poles and zeros of W m {s) are stable. 

M2. The zero structure at infinity of W m (s) is the same as that of G(s), 
i.e. , lim. 5^00 £ m (s)W m (s) is finite and nonsingular. Without loss of 
generality we can choose W m (s) = 
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Assumptions A1 to A4, and Ml and M2 are the extensions of the as- 
sumptions PI to P4, and Ml and M2 in the SISO case to the MIMO one. 
The knowledge of the interactor matrix is equivalent to the knowledge of 
the relative degree in the SISO case. Similarly the knowledge of S p in A4 is 
equivalent to the knowledge of the sign of the high frequency gain. 

The a priori knowledge about G(s) which is necessary for £ m (s) to be 
known is very crucial in adaptive control because the parameters of G(s) 
are considered to be unknown. It can be verified that if £ m (s) is diagonal 
and the relative degree of each element of G(s) is known then £ m (s) can be 
completely specified without any apriori knowledge about the parameters of 
G(.s). When £ m (s) is not diagonal, it is shown in [172, 205] that a diag- 
onal stable dynamic pre-compensator W p (s) can be found, using only the 
knowledge of the relative degree for each element of G(s), such that for most 
cases G(s)W r p (s) has a diagonal £ m (s). In another approach shown in [216] 
one could check both the MLI and MRI matrices and choose the one that is 
diagonal, if any, before proceeding with the search for a compensator. The 
design of MRAC using the MRI is very similar to that using the MLI that 
is presented below. 

We can design a MRAC scheme for the plant (9.7.12) by using the cer- 
tainty equivalence approach as we did in the SISO case. We start by as- 
suming that all the parameters of G(s) are known and propose the control 
law 

u = efu! + 6* 2 t lo 2 + e* 3 r = e* T uj ( 9 . 7 . 14 ) 

where d* T = [ 0 * T , 6 2 T , # 3 ], u = [uj ,r T ] T . 


u 1 


A(s) A(s) 

T7T«, ^2 = TTTl/ 

A(.s A(s) 


A{s) = [Is^-\Is^~\...,IsJ\ t 

/)* T/D* n* lT n* 173* /)* ]T 

“1 ~ rii? • • • ? ’ ^2 — [“21? • • • 5 u 2n 0 \ 

9ie*j€n NxN , i = 1 , 2 , j = i,2 ,...,p 0 

and A(s) is a rnonic Hurwitz polynomial of degree Hq. 

It can be shown [216] that the closed- loop plant transfer matrix from y 
to r is equal to W rn (s) provided 6$ = K^ 1 and 9\,62 are chosen to satisfy 
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the matching equation 


1-91 


*T 


M*) 

A (8) 


_ 0*T A(s) 


A (s' 


G(s) = 9* 3 W~ 1 (s)G(s 


(9.7.15) 


The same approach as in the SISO case can be used to show that the control 
law (9.7.14) with 9*,i = 1,2,3 as chosen above guarantees that all closed- 
loop signals are bounded and the elements of the tracking error e\ = y — y m 
converge to zero exponentially fast. 

Because the parameters of G(s) are unknown, instead of (9.7.14) we use 
the control law 

u = 9 t (t)u (9.7.16) 

where 9{t) is the on-line estimate of the matrix 9* to be generated by an 
adaptive law. The adaptive law is developed by first obtaining an appro- 
priate parametric model for 9* and then using a similar procedure as in 
Chapter 4 to design the adaptive law. 

From the plant and matching equations (9.7.12), (9.7.15) we obtain 


u-et T u 1 -efu 2 = 9tw- 1 (s)y 

9*f\u - 9 * t uj) = W~\s)(y - y m ) = £ m (a)ei (9.7.17) 

Let d m be the maximum degree of £ m (s) and choose a Hurwitz polynomial 
f(s) of degree d m . Filtering each side of (9.7.17) with 1 / f{s) we obtain 

z = i/,*[0* T <j> + z 0 ] (9.7.18) 


where 


^/Tl(^) / s|s — 1 jr 

z = — -—-e 1 , Ip =9 3 = Kp 

m 


f(s. 


-V, Zq = 


f(s. 


-u 


which is the multivariable version of the bilinear parametric model for SISO 
plants considered in Chapter 4. 

Following the procedure of Chapter 4 we generate the estimated value 


Z = 1p{t)[9 T (t)(p + Zq] 
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of z and the normalized estimation error 

z — z 


We can verify that 

_ ipi + ip*6 T (p 
m 2 

where £ = zq + 9 T cp and m? could be chosen as m? = 1 + |£| 2 + \<p\ 2 . The 
adaptive law for 9 can now be developed by using the Lyapunov-like function 

V = ^tr[9 T T p 9} + l -tr[^T-^} 

where r p = Kj S~ l = (5“ 1 ) T (/i p S' p ) T S' p 1 and T = T t > 0. If we choose 

9 t = -S p ej) T , = — Te£ T (9.7.19) 

it follows that 

V = — e T cm 2 < 0 

which implies that 9, ip are bounded and |em| G C- 2 - 

The stability properties of the MRAC (9.7.12), (9.7.16), (9.7.19) are sim- 
ilar to those in the SISO case and can be established following exactly the 
same procedure as in the SISO case. The reader is referred to [216] for the 
stability proofs and properties of (9.7.12), (9.7.16), (9.7.19). 

The multivariable MRAC scheme (9.7.12), (9.7.16), (9.7.19) can be made 
robust by choosing the normalizing signal as 

m 2 = 1 + m s , m s = -6 0 m s + |w| 2 + |y| 2 , m s ( 0) = 0 (9.7.20) 

where So > 0 is designed as in the SISO case and by modifying the adaptive 
laws (9.7.19) using exactly the same techniques as in Chapter 8 for the SISO 
case. 

For further information on the design and analysis of adaptive controllers 
for MIMO plants, the reader is referred to [43, 50, 51, 73, 151, 154, 172, 201, 
205, 213, 214, 216, 218, 228]. 
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9.8 Stability Proofs of Robust MRAC Schemes 


9.8.1 Properties of Fictitious Normalizing Signal 

As in Chapter 6, the stability analysis of the robust MRAC schemes involves the 
use of the £28 norm and its properties as well as the properties of the signal 

m? = l + |K|| 2 +||y P || 2 (9.8.1) 

where ||(-)|| denotes the £ 2 s norm || (•)* II25- The signal to/ has the property of 
bounding from above almost all the signals in the closed-loop plant. In this, book, 
to/ is used for analysis only and is referred to as the fictitious normalizing signal, to 
be distinguished from the normalizing signal to used in the adaptive law. For the 
MRAC law presented in Table 9.1, the properties of the signal to./ are given by the 
following Lemma. These properties are independent of the adaptive law employed. 


Lemma 9.8.1 Consider the MRAC scheme of Table 9.1. For any given 6 € (0, <5o] , 
the signal to/ given by (9.8.1) guarantees that: 


(i) 


mf 


— , * = 1, 2 and — 

m f 7 7 m f 


£ £ c 


(h) if esc. 


then Ail j 

m f 7 m f 7 rrif 7 


W(s 


e £°° w here W(s) is any proper 


f ' m f 

transfer function that is analytic in i?e[s] > — <5 q / 2 . 


(iii) If 9,r £ £oo, then ^ e £ 


(iv) For 6 = <5o, (i) to (iii) are satisfied by replacing to/ with in = \Jl + ri%, where 
n 2 s = m s and m s = — 6 0 m s + u 2 + y 2 , to s (0) = 0. 


Proof (i) We have 

a(s) a(s) 

Wl = A (sj Up ’ W 2 = A ( s) Vp 

Because the elements of are strictly proper and analytic in Re[s] > — So/2, it 

follows from Lemma 3.3.2 that £ £ooA = 1,2 for any S £ (0, <5o] - Now 

||w|| < IMI + 1 1 UO 2 1 1 + Hj/pll + c < cmf + c and therefore £ C^. Because 

n 2 = m s = \\u pt \\l So + hptWlso and ll(')tl|25o < 1 1 ( ") * II 2<5 for any <5 G (0, <5 0 ] , it follows 
that n s /mf G £oq. 

(ii) From equation (9.3.63) and u p = 9 T u> we have 

y P = W m (s)p*e T u + p*ij + W m (s)r 

where 77 = A ^ a W m [ A m ( u p + d u ) + d u \. Because W m , W m A m are strictly proper 
and analytic in Re[s] > —So/ 2, and 9 G £ 00 , it follows that for any S £ (0, do] 


\y P (t)\ < c||<x>|| + c||7ip|| + C < CTO/ + C 
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i.e., Vp/rrif £ Coo. Since to = [ooj , loJ , y p , r] T and r,u>i/nif,i = 1,2 and y p /mf £ 
Coo, it follows that co/mj £ Coo- From u p = 9 T co and 9, ^ £ C^, we 

have e Coo ■ We have \W(s)u>\ < || W 0 (s) || 25 Ill’ll + \d\\\u\\ < cm f where 

Wo(s) + d = W(s) and, therefore, AMP g £ oo _ The signal y p is given by 

rrif y 

y P = sW m (s)r + sW m (s)p* 9 t lo + p*rj 
Because sW m (s) is at most biproper we have 

||2/p|| < c||sW m (s)|| oo5 ||w|| + c||j 7|| + c 


where 


< 


A - 6^ T a(s) 


A(s) 

< cm f + cdo 


slb m (s)A m (s) 


cdn 


00(5 


which together with £ Coo imply that £ Coo- 

(iii) We have to = [ Xff u p > Tpo Vp^VpHV ■ Because the elements of ^x(sY are 
proper and analytic in Re[s] > —Sq/2 and r, £ Coo, it follows that ^dl g £, oo _ 

(iv) For 6 = (5 0 we have mj = 1 + ||upt||| 5o + WUptWlso' Because m 2 = l + 

\\u P t\\is 0 + 1 1 2/pt | ll^o = TO /> the P r0 °f °f (i y ) follows. □ 


Lemma 9.8.2 Consider the systems 

v = H(s)v p (9.8.2) 

and 

rn s = -6 0 m s + \u p \ 2 + \y p \ 2 , m s ( 0) = 0 

m 2 = 1 + m s 

where So > 0, 1 1 Vp* 1 1 25 0 — cm (t) Vt > 0 and some constant c > 0 and H(s) is a 
proper transfer function with stable poles. 

Assume that either || || 2<s 0 — cm (t) Vt > 0 for some constant c > 0 or H(s) is 
analytic in Re[s] > —So/2. Then there exists a constant 6 > 0 such that 

(i) IKtJI < ce-i^-tAmih) + ||tf(s)|| 00 , 5 ||'t;p t>ti ||, \/t>t i>0 

(ii) If H(s) is strictly proper, then 

K*)l < + ||^(s)|| 2 «||vp t , t J|, Vt > h > 0 

where || (•)*,*! II denotes the C 26 norm over the interval [ti,t) 
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Proof Let us represent (9.8.2) in the minimal state space form 


x = Ax + Blip, a:(0) = 0 

v = C T x + Dv p (9.8.3) 

Because A is a stable matrix, we have ||e j4 ( t_T )|| < Aoe _ “°^^ r ^ for some Ao, ao > 0. 
Applying Lemma 3.3.5 to (9.8.3) we obtain 

IKtJ < ce-^-^\x( tl )\ + ||77(s)||oo(5|K , pt,t 1 II (9-8.4) 

and for H (s) strictly proper 

K*)l < + II^WIMK,, II (9-8.5) 

for any 0 < <5 < 2a 0 and for some constant c > 0. If H(s ) is analytic in i?e[s] > 
—5q/2 then A — ^1 is a stable matrix and from Lemma 3.3.3 we have 

\x(t)\ < c\\v p t\\ 25 0 < cm(t) Vf > 0 


Hence, |x(fi)| < cm(t±), (i) and (ii) follow from (9.8.4) and (9.8.5), respectively. 

When H(s) is not analytic in R,e[.s\ > —Sq/2 we use output injection to rewrite 
(9.8.3) as 

x = {A — KC t )x + Bv p + KC T x 


or 

x = A c x + B c Vp + Kv (9.8.6) 

where A c = A — KC T , B c = B — KD and K is chosen so that A c — 4y/ is a stable 
matrix. The existence of such a AT is guaranteed by the observability of [C, A) [95]. 
Applying Lemma 3.3.3 (i) to (9.8.6), we obtain 

|ar(£)| < c||up t || 2 5 0 + c||u t || 2 5 0 < cm(t) \/t > 0 (9.8.7) 

where the last inequality is established by using the assumption ||ut|| 2 a 0) ||u pt || 2( s 0 < 
emit) of the Lemma. Hence, ||x(£i)|| < cm(ti) and (i), (ii) follow directly from 
(9.8.4), (9.8.5). □ 

Instead of to/ given by (9.8.1), let us consider the signal 

m /i (*) = e _< fo _tl) TO 2 (fi) + ||u Pt _ ti || 2 + || y Pttti || 2 , t>t i > 0 (9.8.8) 

The signal to / , has very similar normalizing properties as mf as indicated by the 
following lemma: 

Lemma 9.8.3 The signal mf 1 given by (9.8.8) guarantees that 
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(i) Ui(t)/mf lt i = 1,2; ||u; t)tl W/m^ and n s /m fl G C^ 

(ii) If 8 G Coo, theny p /m fl , u p /m fl , u/m fl , W(s)io/m fl G C ^ and \\u PuH \\/m fl , 

II Vpt,t 1 ll/ m /i ^ ^oo 

(iii) If 8,f G £oo, then \\Cot,tA\/ rn f 1 e £°o 

where ||(-)t,ti|| denotes the C- 25 -norm defined over the interval [ti,t],t >t i > 0; <5 is 
any constant in the interval (0, <5o] and W (s) is a proper transfer function, which is 
analytic in Re[s] > —5q/2. 

Proof (i) We have to i = ^f_u p ,t 02 = y p . Since each element of is strictly 

proper and analytic in Re[s] > — and 1 1 1 1 2 < 5 0 , 1 1 J/pt 1 1 2 < 5 0 < cm, it follows from 

Lemma 9.8.2 that 

Mt)|, lki Ml II < ce"5 (t - 4l) ?n(ti) + c\\u PtrH || < cm fl 

|w 2 (t)|, ||w 2titi || < ce"5( t -*i) TO (t 1 ) + c||y Pt ti || < cm h 
and, therefore, |wj(t)|, ||cCi t t ||,i = 1, 2 are bounded from above by m^. Because 

IKg || 2 < || 2 + K Mi || 2 + || y PtM || 2 + c < cml 

therefore, Hu^tJI is bounded from above by mf 1 . We have n 2 s = m s = m 2 — 1 and 

n 2 = m s (t) = e- So(t ~ tl \m 2 (t 1 ) -l) + \\u Pt t jl So + \\y Pt t jl So 
< e _A ( t_tl) m 2 (f i) + ||w Pt ti || 2 + \\y Pt ti || 2 = m\ 

for any given S G (0, Jo], and the proof of (i) is complete. 

(ii) We have 

y P = W m {s)p*8 T to + p*y + W m (s)r 

Because W m (s) is strictly proper and analytic in f?e[s] > — Jo/2, and Lemma 9.8.1 
together with 9 G £oo imply that \\p*8 T to t \\ 25 0 < cm(t) \/t > 0, it follows from 
Lemma 9.8.2 that 

\y P (t)\, hpt.tj < ce-^-^mfh) + c|K,tJ + c||% )tl || + c, Vf > h > 0 

Now rj = A (s)u p + d v where A(s) is strictly proper and a nalytic in f?e[s] > — 
d v G £oo and ||u Pt || 2( s 0 < m(t). Hence from Lemma 9.8.2 we have 

11%,*! II < ce~l (t_tl) m(ti) + c||u Pt _ ti || + c 

Because u p = 9 T to and 9 G Coo, we have ||u Pt || < clK.tJI, and from part (i) we 
have |K tl || < cm fl . Therefore ||u PMi || < cm/,, ||?7 t)tl || < ce~^ t ~ tl '>m{t 1 )+cmf 1 +c 
and 

\y P (i) I, I \v Pt , H II < ce _ 3 (t_tl) m(ti) + cm fl + c< cm h 
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In a similar manner, we show ||jr Pt | < cmf 1 . From |w(f)| < |wi(f)| + |a> 2 (t)| + 
\y p (t)\ + c, it follows that |w(i)| < cmf 1 . Because u p = 9 T co and 9 G Coo, it follows 
directly that |u p (f)| < cmf 1 . 

Consider v = W(s)lo where W{s ) is proper and analytic in i?e[s] > —5q/2. 
Because from Lemma 9.8.1 ||w ( || 2( 5 0 < cm(t), it follows from Lemma 9.8.2 that 

Ht)l < ce _ 5 (t_tl) m(fi) + c||w Ml || < cm fl 

(iii) We have Co = [d)J ,CoJ ,y p ,r] T , where Cui = Xff u p > = Tpq V p • Because 
the elements of are proper, it follows from the results of (i), (ii) that ||w,: Ml || < 
cnrif 1 ,i = 1,2 which together with r € C^ and ||2/ Pt|tl || < cto/j imply (iii). □ 


9.8.2 Proof of Theorem 9.3.2 


We complete the proof of Theorem 9.3.2 in five steps outlined in Section 9.3. 

Stepl. Express the plant input and output in terms of the parameter error term 
9 t uj. We use Figure 9.3 to express u p ,y p in terms of the parameter error 9 and 
modeling error input. We have 


G 0 Ac* 

Vp (A -Cl)- G 0 D\ 


^9 r o 

c o 


A -C{ 
c* 0 A 


m 


7/1 — A m (s)(u p d u ) T d v 


AcS 


Ur, = 


(A - Cl) - G 0 D{ [ c| 


r+ -4 9 r co 


GqD\ 


(A - Cl) - G 0 D1 


m 


where C*(s) = Of 1 a(s),D * = 0gA(s) + 02 T a(s). Using the matching equation 


GoAcq 

(A -CD- G 0 Dl 


= W„ 


we obtain 


y P 

u p 



Gg 1 W m 


4^ T A + 9y 

c 0 / 

r + 4^ T A + 


(9.8.9) 


Vu = 


9* A + 9* T a 

c*A 


W m rji, ii v = 


A -91' a 


c*A 




where 
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Let us simplify the notation by denoting ||(-) t || 2<5 with || • ||. From (9.8.9) and 
the stability of W m ,Gg 1 W m , we obtain 

IMI < c + c\\e T uj\\ + \\r] y \\, bpll < c+c\\e T u>\\ + bull (9.8.10) 

for some 5 > 0 by applying Lemma 3.3.2. Using the expressions for ij u ,7j y , we have 

IMI < A(S) ~w\ a(g) ||W / m(s)A m (s)|| 00< 5 ||n p || + cdo 

C 0 IY \ S ) OO 8 

1 1 1 1 ^qA(s) Otis') || / \ A / \ II II II 

bull— — * A / \ ||M / m(s)A m (s) lloo^bpll + cdo 

c O i H s l oo<5 

where do is the upper bound for d u and c > 0 denotes any finite constant, which 
implies that 

IMI < cAqoTO/ + cd 0 , 1 1 ? 7 „ 1 1 < cAooTO/ + cd 0 (9.8.11) 

where Aoo = ||W m (s)A m (s)|| 00 5 . From (9.8.10) and (9.8.11), it follows that the 
fictitious normalizing signal inj = 1 + bp II 2 + bp II 2 satisfies 

mj < c + c\\0 T lo\\ 2 + cA^nrj (9.8.12) 


Step 2. Use the swapping lemmas and properties of the C 2 S~norm to bound 
\\6 T u\\ from above. Using the Swapping Lemma A. 2 from Appendix A, we express 
9 t uj as 

,JT 

9 T u> = Fi(s, ao)(6 u> + 9 t oj) + F(s,ao)9 T u> (9.8.13) 

where F(s,a o) = , Fi(s, cto) = «o is an arbitrary constant to be 

chosen, k > n* and n* is the relative degree of G 0 (s) and W m (s). Using Swapping 
Lemma A.l, we have 

e T co = w _1 (s) [fl T fU(s)w + W c (W b LO T )9\ (9.8.14) 

where W(s) is any strictly proper transfer function with the property that W(s), 
W~ x (s) are analytic in Re[s] > —8o/2. The transfer matrices W c {s), W b (s) are 
strictly proper and have the same poles as W(s). Substituting for 9 t uj given by 
(9.8.14) to the right hand side of (9.8.13), we obtain 

•T 

9 r u = F 1 [9 uj + 9 T uj\ + FW- 1 [e T Wuj + W c (W b uj T )e} (9.8.15) 

where FW -1 can be made proper by choosing W(s) appropriately. 

We now use equations (9.8.13) to (9.8.15) together with the properties of the ro- 
bust adaptive laws to obtain an upper bound for ||0 T u;||. We consider each adaptive 
law of Tables 9.2 to 9.4 separately as follows: 
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Robust Adaptive Law of Table 9.2 We have 

e = e f -e f - W m Len 2 s = W m L[p*9 T (f) - - en 2 s + p*p f \ 


therefore, 

F</> = ^ + pi + en^) - 77/ (9.8.16) 

where (j) = L 0 (s)uj, L 0 (s) = L~ 1 (s)0^. Choosing W{s) = L 0 (s) in (9.8.15) we 
obtain 

•T 

e T co = F 1 [9 LU + 6 t uj} + FLq^Lou + W c {W b u T )0\ 

Substituting for 0 T cf> = 9 r L 0 lu and L 0 (s) = L _1 (s)j^-, and using (9.8.16), we 
obtain 


oj = Fi[9 lo + u 1 uj\ 


FW , 


-l • 


Ft 

h 0 


h 0 p* 


FT- 


PS, 


enz 


- 7 ?/ + W c (W b u T )P 


Substituting for £ = uf — 9 T <j> = L 0 9 T u> — 9 T L 0 4> = W c (W6uF)0, where the last 
equality is obtained by applying Swapping Lemma A.l to L 0 (s)9 t uj, we obtain 


;_T 

= Fi[9 w- 


FW~ 


h 0 p* 


°e+FLT 1 


en% 
L P* 


-VI+ ^cFF)9 


(9.8.17) 

Choosing k = n* + 1 in the expression for F(s,a o), it follows that FW~ 1 (s + ho) 
is biproper, FL^ 1 = FL S ^° is strictly proper and both are analytic in Re[s\ > 
— t>o/2. Using the properties of F,Fi given by Swapping Lemma A. 2, and noting 
that II (s+ a o) fc ^^ a tU~ 1 (s)|| 00 ^ and II (s+ l o) fe ^ -Felloe? are finite constants, we 
have 


|^i(s,ao)||oo<5 — 7 

ao 


F(s,a 0 )W-\s) 


< ca q 


\F(s,a 0 )L 0 (s) 


1 00(5 


F(s,a 0 )L(s)- 


— ca o 


(9.8.18) 


for any <5 € (0, <5o]. 

We can use (9.8.17), (9.8.18) and the properties of the £ 2 <5 norm given by 
Lemma 3.3.2 to derive the inequality 


IlFcuH 


< 


-(Ill'll + IlFchll) +cag||e|| +cag||e^|| 
a 0 

+ca k 0 \\W c (W b u; T )e\\ + WFL^v/W 


(9.8.19) 
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From Lemma 9.8.1 and 9 £ Coo, we have 

LO ||u|| W b w r n s ^ r 

7 ? 5 ^ *■'00 

rrif rrif rrif rrif 

\\W c {W b w T )d\\ < c||0m/||, ||0 T A|| < cm f , ||en^|| < ||en s m/|| 
Furthermore, 

|| J FL 0 - 1 r ?/ || = \\Frto\\ = \\FW~\W < ||F( S )W- 1 ( S )|| oo5 ||r ? || < ca k 0 \\ V \\ 

where 

A(s) — 01 T a(s) Tr7 . , 

9 — A(s) W m (s)[A m (s)(u p + d u ) + d u \ 

and 

INI < HS) ~J\ a{S) l|W / m (s)A m ( s )|| oo5 ||u p || + cd 0 

oo<5 

< cAaoirif + cdo 

Hence, 

|| FL^rjfW < ca^A^mf + cd 0 
and (9.8.19) may be rewritten as 

||d T w|| < — ||0 to/|| + —to/ + cag(||en s TO./|| + || cto / 1| + ||0m/||) + cagAooTO/ + cd 0 
ao a o 

(9.8.20) 

where c > 0 denotes any finite constant and for ease of presentation, we use the 
inequality ||e|| < ||em/|| in order to simplify the calculations. We can also express 
(9.8.20) in the compact form 

||0 T w|| < c||<?to/|| + c + a^Aooj to/ + cdo (9.8.21) 
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Hence, in (9.8.21), g G S(f 0 + Ag 2 + ^). 

Robust Adaptive Law of Table 9.3 This adaptive law follows directly from that 
of Table 9.2 by taking L _1 (s) = W m (s),L 0 (s) = W m (s) and by replacing by 
unity. We can therefore go directly to equation (9.8.17) and obtain 


2T 


0 lo = Fi(9 w + 0'u>) + 


fw ; 


-i 


■FW, 


-l 


en 


A + ^(% T )«-,, 

p p 


where 

A(s) — 01 T a(s) ... , . 

V — , , , W m (s)[A„ l (s)(u p + d u ) + d u ] 

A[s) 

The value of k in the expression for F(s, ao) can be taken as k = n* (even though 
k = n* + 1 will also work) leading to FW~ X being biproper. 

Following the same procedure as in the case of the adaptive law of Table 9.2, 
we obtain 


ll^ll 


< -(ll^ll + P' T A||) +cag|| C || +ca k 0 \\enl\\ 
«o 

+ ca k 0 \\W c (W b u T )e\\ + WFW-'vW 


which may be rewritten in the form 

||0 T w|| < c||< 7 TO/ 1| + c + Oq Aqo^ mf + cd 0 (9.8.23) 

where g 2 = + ag fc (|en s | 2 + \6\ 2 + e 2 ). Because e, en s , 9 G S(fo + ^), it follows 

° 2 

that g G <S(/o + ^ 2 ). Because 

|?y(t)| < A 2 m(t) + cd 0 


where 


A 2 — 


A(s) - 9*C a(s) 


A (s) 


W m {s)A m (s) 


2^n 


it follows that in (9.8.23) g G S(fo + A 2 + ^). 


(9.8.24) 


Robust Adaptive Law of Table 9.4 We have em 2 = z — z = —9 T (f> p — rj, i.e. , 
9 T <!> p = —rj — em 2 . We need to relate 9 T (f> p with 9 t lo. Consider the identities 


9 t 4> p = 9j4> 0 + c 0 y p , 9 t oj = 9ju) 0 + c 0 r 


where 9o = [9j , 6j, # 3 ] T , too = [^1 ,^2 ^VpV and 0o = W m (s)ui 0 . Using the above 
equations, we obtain 


0o = 9 T 0 P - c 0 y p = - em 2 - c 0 y p - rj 


(9.8.25) 
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Let us now use the Swapping Lemma A.l to write 

w m (s)d T u = ej w m (s)io 0 + c 0 y m + w c (w b u T )e 

Because 9j(f ) o = 9jW m (s)uJo, it follows from above and (9.8.25) that 

W m (s)d T co = - em 2 - c 0 y p - ry + c 0 y m + W c (W b uj T )9 
Substituting for 

y P = Vm + —W m (s)6 r u + r] y 
c o 

in (9.8.26) and using y = c$y y , where 

A(s) — #f T c>!(s) Trr ... . 

Vy — * . 7 7 W / m(s)[A m (s)('Up + d u ) + d u ] 

we obtain 

W m (s)Fu> = - em 2 - C ^W m (s)9 T L0 + W c {W b u T )9 - c 0 y v 
c o 

Because 1 + % = % and A g £ oo we have 
C 0 C 0 C ° 

W m (s)d T u) = — — em 2 + ^(WftW 1 ")^ — c 0 y y 
c 0 L J 

Rewriting (9.8.13) as 

0 T w = F x {9 u + 0 T w) + 

and substituting for W m (s)8 T u> from (9.8.27), we obtain 


(9.8.26) 


(9.8.27) 


- T — 

6 t lu = F 1 (9 u + 9 t u>) 

+ FW~ l — \-em 2 + W c (W b u T )8 - c 0 rJ (9.8.28) 

c 0 L J 

Following the same approach as with the adaptive law of Table 9.3, we obtain 


||0 T w|| < c||5TO/|| + c + OpAoc j rrif + cd 0 (9.8.29) 

where g £ S(f 0 + £s) or g £ S(f 0 + A% + fy) and g 2 = ^ + off (\en 2 s \ + \8\ 2 + e 2 ) . 

Step 3. Use the B-G Lemma to establish boundedness. The bound for ||0 T w|| 
in (9.8.21), (9.8.23), and (9.8.29) has exactly the same form for all three adaptive 
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laws given in Tables 9.2 to 9.4. Substituting for the bound of ||d T w|| in (9.8.12) we 
obtain 

m 2 f < c + c ||<? to /|| 2 + c ( — + ccj^A^, j m 2 + cd$ 

\ Q o / 

For 

C (4 + “° fcA °°) <X (9 ' 8 ' 30) 

we have 

m 2 < c 0 + c\\gmf \\ 2 
where Co depends on g?q, which may be rewritten as 

m 2 < Co + c f e~ s ‘ yt - T ' , g 2 (r)m 2 (r)dr 
Jo 

Applying the B-G Lemma III, we obtain 

~9 2 PP- + Cq< 5 f e -S(t-s) e c f B ~9 2 PUr ds 

Jo 

d 2 

Because g € S(f 0 + Af + -%) where A, = A 0 2 for the SPR-Lyapunov based adaptive 
law and A,; = Ao for the adaptive laws of Tables 9.2 and 9.4, we obtain 

e -l(t-s) e cf:^r ds (9 8 31) 

provided 

c 

c(fo + A 2 ) < - (9.8.32) 

where c in (9.8.32) is proportional to A + «n 1 ' aR d can be calculated by keeping 
track of all the constants in each step. The constant c also depends on the H^s and 
H 25 norms of the transfer functions involved and the upper bound for the estimated 
parameters. 

To establish the boundedness of m/, we have to show that < 5 f° r all 

t > 0 or for most of the time. The boundedness of m.f will follow directly if we 
modify the normalizing signal as mr = 1 + n 2 ,n 2 = /3o + m s and choose the constant 
Po large enough so that 

cdp < cdg < 6 
m 2 (t) ~ Po ~ 2 

Vt > 0. This means that m is always larger than the level of the disturbance. Such a 
large m will slow down the speed of adaptation and may in fact improve robustness. 
A slow adaptation, however, may have an adverse effect on performance. 

The boundedness of signals can be established, however, without having to 
modify the normalizing signal by using the properties of the C- 2 & norm defined over 


2 / — c 
nif < ce 2 e 


2 M 


-dr 


CoS 


m 2 < ce 5 t e c 
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an arbitrary interval [t\,t] given by Lemma 9.8.2, 9.8.3 and by repeating steps 1 to 
3 as follows: 

We apply Lemma 9.8.2 to (9.8.9) to obtain 

I| 2 i llz/pt,*! IP < ce^ d(i_il) m 2 (ti) + c||0 T w tjtl || 2 + cA^ x> \\u Pt ti || 2 + cd% 

where we use the fact that ||6* T W(||25 0 , || Z/p* || 25 0 ? Il u pi||2<5 0 A emit). Therefore, the 
fictitious signal m 2 ± = e~ s ^ t ~ tl ' > m 2 {t\) + ||u Pt || 2 + \\y Putl || 2 satisfies 

m 2 i < ce~ 5 ^ t ~ tl ' > m 2 {t\) + c||0 T w tjtl || 2 + cA^m^ + cd 2 Vi > t\ > 0 
Following the same procedure as in step 2 and using Lemma 9.8.2, 9.8.3 we obtain 

||0 T w|| 2 < ce"' 5(t '" tl) m 2 (ti) + c\\(gm fl ) tM \\ 2 + C (J^ + a o^ ^"4 + cd l 

where g is as defined before. Therefore, 

m 2 fi < ce-^-^m 2 ^) + c\\{gm. h ) tM \\ 2 + c(\ + a 2 0 k \ + cd 2 0 

\ a o / 

Using (9.8.30), we obtain 

m 2 fl < ce _A(i_tl) m 2 (ti) + clK^m/J^tJI 2 + edg, Vi > i 1 


or 


(t) < c + ce ^ tl ^m 2 (ti) + c f 

Jt i 

Applying the B-G Lemma III, we obtain 

rrtf^t) < + m 2 (ti))e°^' t i ® ^ dr 


e- 5(t - ) 5 2 (r)m 2 i (r)d T 
+ c<5 f* e^-^e 0 f! §2(T)dr ds 

Jt i 


Vi > t-| > 0. Because g G <S(/o + A 2 + ^), it follows as before that for c/o + cA? < 
<5/2, we have 


m 2 (i) < < ce <5 - /2 ^ t 4l /l + m 2 (t 1 ))e C ^ t i d °^ m dT 

+CS f e-U2(t-s) e cr 3 d 2 o/mRr)dr ds ( 9 . 8 . 33 ) 

Jt! 

Vi > ti, where the inequality m 2 (i) < follows from the definition of m y, . If we 
establish that m G £<*,, then it will follow from Lemma 9.8.1 that all signals are 
bounded. The boundedness of m is established by contradiction as follows: Let us 
assume that m 2 (f) grows unbounded. Because 0 G £oo, it follows that 

m 2 (i) < e fcl(t - to) m 2 (t 0 ) 
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for some fci > 0, i.e., m 2 (t) cannot grow faster than an exponential. As m 2 (t) grows 
unbounded, we can find a to > d > 0 and a f 2 > to with d > f 2 — to such that 
m 2 (t 2 ) > ae kia for some large constant a > 0. We have 

de fcia < m 2 (f 2 ) < e kl ^- to) m 2 (t 0 ) 


which implies that 

In m 2 (to) > In d + k\ [d — (t 2 — to)] 

Because a > t 2 — to, it follows that lnm 2 (to) > lnd, i.e., m 2 (to) > d for t 0 G 
(t 2 - d,t 2 ). 

Let ti = sup r<(2 {arg(?n 2 (r) = a)}. Since m 2 (to) > d for all to G (t 2 — a, t 2 ), 
it follows that ti < t 2 — a and ?n 2 (t) > d Vt G [ti, t 2 ) and t 2 — 1 1 > d. We now 
consider (9.8.33) with ti as defined above and t = t 2 . We have 

m 2 (t 2 ) < c(l + d)e" /3(t2 - tl) +cS f 2 e- 0{t ~ s) ds 

Jt i 

where (3 = | — For large d, we have /3 = | ^ > 0 and 

m 2 (t 2 ) < c(l + a)e~ 0a + ^ 

Hence, for sufficiently large d, we have m 2 (t 2 ) < c < d, which contradicts the 
hypothesis that ?Ti 2 (t 2 ) > de fcl “ > d. Therefore, m G £oo, which, together with 
Lemma 9.8.1, implies that all signals are bounded. 

Step 4. Establish bounds for the tracking error. In this step, we establish 
bounds for the tracking error ei by relating it with signals that are guaranteed by 
the adaptive law to be of the order of the modeling error in m.s.s. We consider each 
adaptive law separately. 

Robust Adaptive Law of Table 9.2 Consider the tracking error equation 

ei = W m (s)p*6 T u + T] v 

We have 

Mt)l< l|W ro ( a )|| 24 ||Fa;|||p*| + | % | 

Therefore, 

M*)| < c\\9 t u\\ +c\g v \ 

Using (9.8.21) and m/ G Too in the above equation, we obtain 

|ei(t )| 2 < c||^|| 2 + c (— + «o A oo) + cdl + c\r ] y \ 2 
yao / 
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We can also establish that 


\Vy(t) I 2 < cAj + cdl 


Therefore, 


|ei(t)| < c\\g\\ 2 + c + ao'Aoo J + cdfi + cA 2 


where g G S(f 0 + Ag 2 + Tq). Using Corollary 3.3.3, we can establish that 
S(fo + Aq 2 + do) and, therefore, 


pt-\-T 

J |ei| 2 dT < c(/ 0 + A 2 + dg)T + c 


where A = Y + + A% + A' 2 2 . 

Robust Adaptive Law of Table 9.3 It follows from the equation of the estimation 
error that 

e i — p£ + em2 

Because e, £, em G <S(/o+A|+dg) and p, m G T oo, it follows that ei G <S(/o+A 2 +dg). 

Robust Adaptive Law of Table 9.4 Substituting (9.8.27) in the tracking error 
equation 

ei = —W m (s)9 T uj + r]y 
c o 

and using rj = CqP v , we obtain 

ei = — (—em 2 + W c (Wb(v T )0) 
c 0 V J 

Using to G Too, 0 G S(fo + A 2 + dg) , we have that the signal £ = W c (Wbiv r )0 G 
S(fo + A 2 + dg). Since p~,m G Too and em G <S(/o + A 2 + dg), it follows that 
e i € 5(/ 0 + A 2 + dg). 

Step 5: Establish parameter and tracking error convergence. As in the ideal 
case, we first show that <j), cj) p is PE if r is dominantly rich. From the definition of 
</>, 4> p , we have 


W (s^u 

A (s) u P vy mys) A ..a p 

0 = H(s) A U) y P , <t> p = Wm ( s )~K U) y P 

Up W m {s)y p 

r J [ Up . 

where H(s) = L 0 (s ) for the adaptive law of Table 9.2 and H(s ) = W m {s) for the 
adaptive law of Table 9.3. Using y p = W m (s)r+e i, u p = Gg 1 {s)(W m (s)r+ei) — rji, 
rj\ = A m (s)(u p + d u ) + d u , we can write 


I'm i Yei Yp — Ypm Ype 
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where 


<t>m = H (s) 


r, cj) prn = W m {s) 


W m (s) 

1 

a(s) ,0-1/ \ 

A(s) ( - t O 

4>e = H(s) A(s) e 1 — H(s) 


^G^(s)W m (s) 

w m (s ) 

1 


Wmis^G^is) 

W m (s)^ 
w m {s ) 

1 




Because we have established that ei £ S( A 2 + + fo) and u p £ £oo, we conclude 

that (/> e , <f>p e € <S(A 2 + Oq + fo). In Chapter 8, we have proved that 4>m,<t>pm are 
PE with level a 0 > 0(A 2 + dg) provided that r is dominantly rich and Z p . R p are 
coprime, i.e., there exist To > 0,T p o > 0, ao > 0, a p o > 0 such that 

1 rt+T 0 rt+T p 0 

Tfr I 0m(r)^(r)dr > a 0 I, — ^ P m(r)^p m (r)dT > a p0 I 

J 0 Jt J pO Jt 


\/t > 0. Note that 


-j /»t+nTo -| rt 

W 0 J t ^ (r) ^ (r)dT * 2*Zb/ 




-j /^t+nTo 

Tf / 0e(r)^(r)dr 

nT 0 J t 

> ^7- (c(A 2 + d 2 + / 0 ) + ^/ 

where n is any positive integer. If we choose n to satisfy -^r < then for 
c(A 2 + do + fo) < if, we have 


i rt+nT 0 

— J ' <Kr)<£(r)ir > 


which implies that <f> is PE. Similarly, we can establish the PE property for (j> p . 

Using the results of Chapters 4 and 8, we can establish that when </>, or (f> p is 
PE, the robust adaptive laws guarantee that 9 converges to a residual set whose 
size is of the order of the modeling error, i.e., 9 satisfies 

|6*(£)| < c(/ 0 + A + do) + r g(t) 


(9.8.34) 
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where rs(t) — > 0 as t — > oo. Furthermore, for the robust adaptive law of Table 9.4 
based on the linear parametric model, we have rsft) — > 0 exponentially fast. Now 

ei{t) = \w m (s)6 r uj + r) y (9.8.35) 

c o 

where r) y = A ~ c »a a W m (d u + A m (s)(u p + d u )). Because u G and \r] y \ < c(A 2 + 
do), we can conclude from (9.8.34) and (9.8.35) that 

|ei(t)| < c(/ 0 + A + d 0 ) + cr e (t) (9.8.36) 

where r e (t) = fg h m (t — r)rg(r)dr and h m (t) = C~ 1 {W m (s)}. Therefore, we have 
r e (t) — > 0 as t — > oo. Furthermore, when rsft) converges to zero exponentially 
fast (i.e., the adaptive law of Table 9.4 is used), r e (t) — > 0 exponentially fast. 
Combining (9.8.34) and (9.8.36), the parameter error and tracking error convergence 
to S follows. 


9.9 Stability Proofs of Robust APPC Schemes 

9.9.1 Proof of Theorem 9.5.2 

The proof is completed by following the same steps as in the ideal case and Example 

9.5.1. 


Step 1. Express u p , y p in terms of the estimation error. Following exactly the 
same steps as in the ideal case given in Section 7.7.1, we can show that the input 
u p and output y p satisfy the same equations as (7.4.24), that is 

x = A(t)x + bi(t)em 2 + b? y m 
y p = Cjx + diem 2 + d 2 ym (9.9.1) 

u p = Cj x + d 3 e?n 2 + dFy m 


where x, Aft), bi(t), b 2 and Cj, * = 1,2, d*,, k = 1, 2, 3,4 are as defined in Section 7.7.1 
and y rn = P-^-^y m . As illustrated in Example 9.5.1, the modeling error terms due 
to A m (s),d u do not appear explicitly in (9.9.1). Their effect, however, is manifested 
in e, em, ||A(<)|| where instead of belonging to C 2 as in the ideal case, they belong 
to S(^s + f 0 ). Because ^ < A^ + ^, we have e, em., ||A(t)|| G <S(A| + ^ + / 0 ). 

Step 2. Establish the e.s. property of Aft). As in the ideal case, the APPC law 
guarantees that detfsl — Aft)) = A*(s) for each time t where A*(s) is Hurwitz. If 
we apply Theorem 3.4.11 (b) to the homogeneous part of (9.9.1), we can establish 
that Aft) is u.a.s which is equivalent to e.s. provided 




m 2 


dr) < fT 


c(/o + A + — 


(9.9.2) 
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Vt > 0, any T > 0 and some n* > 0 where c > 0 is a finite constant. Because 
m 2 > 0, condition (9.9.2) may not be satisfied for small A 2 , /o unless do, the upper 
bound for the input disturbance, is zero or sufficiently small. As in the MRAC 
case, we can deal with the disturbance in two different ways. One way is to modify 
m 2 = 1 + m s to m 2 = 1 + (3o + m s where /3q is chosen to be large enough so that 
< c|^ < say, so that for 

c(/o + A 2 ) < 

condition (9.9.2) is always satisfied and A(t) is e.s. The other way is to keep the same 
in 2 and establish that when m 2 grows large over an interval of time I\ = [t i , 0] , 
say, the state transition matrix <f>(t,T) of A[t) satisfies ||<&(t, t)|| < kie~ k2 ^~0 
Vt > t and t,r £ I\. This property of A(t) (when m 2 grows large) is used in 
Step 3 to contradict the hypothesis that m 2 could grow unbounded and conclude 
boundedness. 

Let us start by assuming that m 2 grows unbounded. Because all the elements 
of the state x are the outputs of strictly proper transfer functions with the same 
poles as the roots of A(s) and inputs u p , y p (see Section 7.7.1) and the roots of A(s) 
are located in Re[s] < — So/2, it follows from Lemma 3.3.2 that £ Coo ■ Because 
y rn , em £ C a Q , it follows from (9.9.1) that ^ £ C^. Because u 2 , y 2 are bounded 
from above by m 2 , it follows from the equation for to 2 that in 2 cannot grow faster 
than an exponential, i.e., m 2 {t) < e kl< ' t ~ t °' > m 2 {to)Nt > to > 0 for some k > 0. 
Because m 2 (t) is assumed to grow unbounded, we can find a to > ot > 0 for any 
arbitrary constant a > f 2 — to such that m 2 (f 2 ) > ae kia . We have 

de fcl “ < m 2 (< 2 ) < e fel(t2 - to) m 2 (< 0 ) 


which implies that 

In to 2 (to) > In a + k\ [a — (t 2 — to)] 

Because a > t 2 — to and to £ (t 2 — a, t 2 ), it follows that 

m 2 (t 0 ) > a, Vt o £ ( t 2 — d,i 2 ) 

Let ti = sup T<t2 {arg(m 2 (t) = a)}. Then, ?n 2 (ti) = a and m 2 (t ) > a,Vt £ [ti , t 2 ) 
where t\ < t 2 — a, i.e., t 2 — t\ > a. Let us now consider the behavior of the 
homogeneous part of (9.9.1), i.e., 

Y = A(t)Y (9.9.3) 

over the interval I\ = [ti, t 2 ) for which ?n 2 (t) > a and t 2 — ti > a where a > 0 is an 
arbitrary constant. Because clet(s/ — A(t)) = A*(s), i.e., A(t) is a pointwise stable 
matrix, the Lyapunov equation 

A r (t)P(t) + P(t)A(t) = -I 


(9.9.4) 
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has the solution P(t) = P T (t) > 0 for each t € I\. If we consider the Lyapunov 
function 

V(t) = Y T (t)P(t)Y(t) 

then along the trajectory of (9.9.3), we have 

v = -y t y + y t Py < - y t y + \\P(t)\\Y T Y 

As in the proof of Theorem 3.4.11, we can use (9.9.4) and the boundedness of P, A 
to establish that ||P(f)|| < c||A(f)||. Because Ai Y T Y < V < X 2 Y T Y for some 
0 < Ai < A 2 , it follows that 

V < ^ - cX^\\A(t)\\)V 

i.e., 

V(t) < e _ /r (A2 1_cA i 

V t > t > 0. For the interval I\ = [ti,t 2 ), we have m 2 (t) > a and, therefore, 

[ l|A(r) \\dr < c(Al + f 0 + —)(t - r) + c 
Jr a 

and therefore, 

V(t) < e- x ^- T W{T), Vt,r G [ti,t 2 ) (9.9.5) 

and t > t provided 

c(/o + Ao + ~) < Ao (9.9.6) 

a 

A -1 

where Ao = — |— . From (9.9.5) we have 

Ai Y T (t)Y(t) < Y T (t)PY{t) < e - A o(t- T ) A 2 F T ( T )y(r) 
which implies that 

TO I < \[^e~ x °^\Y(T)\, Vi,r G [h ,i 2 ) 

V M 

which, in turn, implies that the transition matrix $(t, r) of (9.9.3) satisfies 

||$(*,t)H < /3 0 e _a ° (t_r) , Vt , t £ lh,t 2 ) (9.9.7) 

where /3q = o = 4p. Condition (9.9.6) can now be satisfied by choosing 

a large enough and by requiring A 2 , /o to be smaller than some constant, i.e., 
A -1 

c(/o + A|) < j ,, say. In the next step we use (9.9.7) and continue our argument 
over the interval I\ in order to establish boundedness by contradiction. 
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Step 3. Boundedness using the B-G Lemma and contradiction. Let us apply 
Lemma 3.3.6 to (9.9.1) for t € [ti , ^ 2 ) - We have 

^ ce~ s/2 ^'~ tl ' , \x(ti)\ +c||(em 2 ) Ml || +c 

where ||(-) t)tl || denotes the £26 norm || (•)*,*! II 2<5 defined over the interval \t\,t), for 
any 0 < <5 < <5i < 2a 0 - Because ^ it follows that 

|| < ce _A/2(t_tl) m(ti) + c|| (em 2 ) Ml || + c 
Because ||y pt)tl ||, ||up t)tl || < c||a; Ml || + c||(em 2 ) t , tl || + c, it follows that 

\\y P t, tl \l IKt.tJ < ce~ A/2(t_4l) m(fi) + c\\(em 2 ) t , tl \\ + c 

Now m 2 (t) = 1 + m s {t) and 

m. s (t) = e~ So( ' t ~ tl ^m s (ti) + \\y ptM \\l 5o + \\u ptM \\ 2 So 
Because || (•)*,*! II 250 < ll(-)t,*i II for s < ^' 0 , it follows that 

m 2 {t) = 1 + m s (t) < 1 + e~‘ 5o(t_tl) m 2 (ti) + || y pt , tl || 2 + \\u pt ,t 1 || 2 Vt > h 
Substituting for the bound for || 2 / ptitl ||, Hupt.till we obtain 

m 2 {t) < ce~ s< ' t ~ tl \n 2 (ti) + c||(em 2 ) titl || 2 + c Vi > t\ > 0 


or 


~S(t-r) 2 2 2 


(r)dr 


(9.9.8) 


?n 2 (t)<c+ce s (* 4l i?n 2 (ti)+c f 

Jti 

Applying B-G Lemma III we obtain 
m 2 (t) < c(l + ?n 2 (ti))e -<5 d- t i) e c /t 1 e m dT + c $ f e - s ( t ~ s ) e c f g e m Vt > t\ 

Jt 1 

For t, s g [ti, t 2 ) we have 

c J e 2 m 2 dr < c ^A^ + fo + (t — s) + c 

d 2 Sj 

By choosing a large enough so that c-A < | and by requiring 


c(A 2 + /o)<- 


we have 


2 (t 2 ) < c(l + m 2 (ti))e 4l i+c(5 f e S )<is 

Jt 1 

< c(l + m 2 (ti))e"3( t2 - tl ) +c 
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Because <2 — h > ot,m 2 (ti) = a and m 2 (t 2) > a , we have 

d < m 2 (t 2 ) < c(l + d)e _ ~ + c 

Therefore, we can choose d large enough so that m 2 (t 2 ) < d which contradicts the 
hypothesis that to 2 (< 2 ) > d. Therefore, m € Coo which implies that x,u p ,y p £ Coo- 
The condition for robust stability is, therefore, 

c(/ 0 + A 2 ) < min{ = 5* 
for some finite constant c > 0. 

Step 4. Establish bounds for the tracking error. A bound for the tracking 
error ei is obtained by expressing e\ in terms of signals that are guaranteed by the 
adaptive law to be of the order of the modeling error in m.s.s. The tracking error 
equation has exactly the same form as in the ideal case in Section 7.7.1 and is given 
by 

A (s)s n ~ 1 Q m (s) 2 A(s)oA_ 2 (s) 

61 = FW ™ + A-fs) 

(see equation (7.7.21)) where Vq is the output of proper stable transfer functions 
whose inputs are elements of 9 P multiplied by bounded signals. Because 9 p ,em 2 £ 
5(^2 + /o) aR d ^2 < c(A| + dg), due to m £ C^, it follows from Corollary 3.3.3 
that ei £ S( A 2 + dg + fo) and the proof is complete. 


9.9.2 Proof of Theorem 9.5.3 

We use the same steps as in the proof of Example 9.5.2. 

Step 1. Develop the state error equations for the closed-loop plant. We start 
with the plant equation 


RpUp — Zp( 1 A A m )(u p + d u ) 


Operating with on each side, we obtain 


11 


Qrr 

Ql 


Q 


Qr 


Up = Rp-^-ei = Z p u p + Z p ---[A m (u p + d u ) + d u \ 


i.e., 


Q 


Z p Q 1 

ei = — „ — u. 


ZpQrn 


Rp Q m Rp Q m 


Qi 


\Am(u p A du) A du\ 


Because is strictly proper, we can find an arbitrary polynomial A(s) whose 

roots are in Re[s] < —S 0 / 2 and has the same degree as R p , i.e., n and express e\ as 
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where 

% 

T] = ^ [X m (u p “b d u ) + d u ] 

We express (9.9.9) in the following canonical state-space form 

e = Ae + + -Bi?? 

ei = C' T e + dir7 

where C r (sI-A)- 1 B = ^^,C T (si ~ A)~ l B x + d i = with 


A = 




*-n-\-q— 1 


o 


B = 9* 2 , C t = [1, 0, . . . , 0] 


(9.9.10) 


and 91, 9 2 are defined as R p Q m = s n+q + 0J T a n+(? _i(s), Z p Qi = 9 2 T a n+q _ i. 

It follows that e 0 = e — e is the state observation error. From the equation of e 
in Table 9.6 and (9.9.10), and the control law u p = — K c (t)e , we obtain 

'e = A c (t)e + k 0 C T e 0 + K 0 d 1 ri (9.9.11) 

e 0 = A 0 e 0 + 9ie\—9 2 u p + B- 2 i) (9.9.12) 

where B 2 = B\ — 9\d\ — K 0 (t)d\ and 


A 0 = 

* 

—a 

In-\-q— 1 



0 


whose eigenvalues are equal to the roots of A*(s) and therefore are located in 

Re[s] < - V 2 ; A c (t) = A - BI< C and 9 X = 9 1 - 9^ 9 2 = 9 2 - 9%. The plant 
output satisfies 

y p = C T e 0 + C T e + y m + d±ij (9.9.13) 

As shown in Section 7.3.3, the polynomials Z pi Q\, R p ,Q m satisfy the equation 
(7.3.25), i.e., 

RpQ m X + Z P Q{Y = A* (9.9.14) 

where X , Y have degree n + q— 1, X is monic and A* is an arbitrary monic Hurwitz 
polynomial of degree 2 (n + q) — 1 that contains the common zeros of Q\, R p Q m . 
Without loss of generality, we require A*(s) to have all its roots in Re[s] < — Ao/2. 
From (9.9.14) and Q m u p = Q\u p , it follows that 

R p XQi _ QxYZp 
u p = A * u p + u p 
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Because R p y p = Z p u p + Z p [A m (u p + d u ) + d u ], we have 


R P XQ 1 _ 
Up = (a* u p 


QiYR p 

A * yp - 


QiYA 

—^9 


(9.9.15) 


by using the definition of A, y given earlier. 

Equations (9.9.11) to (9.9.15) together with u p = —K c (t)e describe the stability 
properties of the closed-loop APPC scheme of Table 9.6. 


Step 2. Establish stability of the homogeneous part of (9.9.11) and (9.9.12). 
The stability of the homogeneous part of (9.9.12) is implied by the stability of 
the matrix A a whose eigenvalues are the same as the roots of A*(s). Because 
det(sJ — A c (t)) = A*(s), we have that A c (t) is pointwise stable. As we showed in 
the ideal case in Section 7.7.2, the assumption that Z p ,Q\R p are strongly coprime 
implies that (A, B) is a stabilizable pair [95] in the strong sense which can be used to 

show that K c , K c are bounded provided 61,62,61,62 £ Coo- Because the coefficients 
of Z p (s , t),R p (s, t) generated by the adaptive law are bounded and their derivatives 

belong to £ <S(^ + /o), it follows that |6>i|, \6i\, | K c \, \K C \ £ Coo and \6i\,\k c \ £ 
5(^t +/o) for i = 1, 2. Because A c = A—BK C , it follows that ||A c (t)||, || A c (f)|| £ Coo 
and ||A c (f)|| £ S(rf /m 2 + f 0 ). Because ^ < A 2 + / 0 * dr where d 0 is an upper 
bound for |d„|, the stability of A c (t) cannot be established using Theorem 3.4.11 (b) 
unless — is assumed to be small. The term do/m can be made small by modifying 
m to be greater than a large constant /3q, say, all the time as discussed in the proof 
of Theorem 9.5.2. In this proof we keep the same m as in Table 9.6 and consider 
the behavior of the state transition matrix < E’(t,r) of A c (t) over a particular finite 
interval where m(t) has grown to be greater than an arbitrary constant a > 0. The 
properties of 4>(f, r) are used in Step 3 to contradict the hypothesis that m 2 can 
grow unbounded and conclude boundedness. 

Let us start by showing that m cannot grow faster than an exponential due to 
the boundedness of the estimated parameters A, B. Because (C,A) in (9.9.10) is 
observable, it follows from Lemma 3.3.4 that 


|e(t)|, ||e t || 2 5 0 < c||upt|| 2< 5 0 + c||r? t || 2 5 0 + c||j/ pt || 25o + c 

Because u p = of u p , it follows from Lemma 3.3.2 that ||u P (|| 2( 5 0 < cm which together 
with ||? 7 t|| 2( 5 0 < A 2 ?n imply that e/m £ Coo- Similarly, applying Lemma 3.3.3 to 
(9.9.12) and using 6, £ Coo, we can establish that e 0 /m £ Coo which implies that 
e = e — e 0 is bounded from above by m. From (9.9.13), (9.9.15), it follows that 
u p /m,y p /m £ Coo which together with the equation for m 2 imply that m 2 (t) < 
e fci(t-t 0 ) m (to) Vt > <0 > 0 for some constant k\. 

We assume that m 2 (t) grows unbounded and proceed exactly the same way as 
in Step 2 of the proof of Theorem 9.5.2 in Section 9.9.1 to show that for 

C (A 2 + fo) < A 
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and some constant A > 0 that depends on the pointwise stability of A c (t), we have 
||^(i,r)|| < A 0 e -ao( *~ r) Vt,r G [ti,t 2 ) 

where t 2 — t\ > a,m 2 [t\) = a,m 2 (t 2 ) > de fcl “ > a, m 2 {t) > a Vi G [ti, t 2 ) and a 
is large enough so that do /a < c A. 

Step 3. Boundedness using the B-G Lemma and contradiction. Let us apply 
Lemma 3.3.6 to (9.9.11) for t G [ii,i 2 ). We have 

||e Ml ll < ce- | (‘- tl) |e(t 1 )| + c||(C T e 0 ) t , tl || + c|| % , tl || 

where ||(-)t,iil| denotes the £ 2 < 5 -norm ||(-)t,ti || 2 j for some 5 > 0. Because e/m G £oo, 
we have 

||et, tl || < ce~^ (t ^ tl ' > m(ti) + c\\(C T e 0 ) tttl \\ + cWgtuA (9.9.16) 

From (9.9.13), we have 

IbpMill ^ c||(C T e 0 ) Ml || + c||et jtl || + c||? 7 Ml || + c (9.9.17) 

Applying Lemma 3.3.6 to (9.9.15) and noting that the states of any minimal state 
representation of the transfer functions in (9.9.15) are bounded from above by m 
due to the location of the roots of A*(s) in Re [,s] < — (5o/2 and using u p = —K c e, 
we obtain 

IKmiII < c||e Ml || +c\\y ptM \\ +c\\r)t ttl \\ + ce~i (t ~ tl) m{ti) (9.9.18) 

Combining (9.9.16), (9.9.17) and (9.9.18) we have 

m 2 fl (t) = e~ 5o ^ t ~ tl) rn 2 (t 1 ) + \\u ptM \\ 2 + \\ypt,tA 2 (9.9.19) 

< ce'‘ 5(t_tl) m 2 (ti) + c||(C T e 0 ) t>tl || 2 + c||? 7 t , tl ||+c,Vt G [ii,i 2 ) 

Now from (9.9.12) we have 

C T e 0 = C T (sI-A 0 )- 1 (e 1 e 1 -e 2 u p ) + C T (sI-A 0 )- 1 B 2 r ] 

a n+q-l(s) ~ _ . a n + q -i (s) . . 

= A-M e ‘ u > )+ A-M B2V <9 ' 9 ' 20) 

Following exactly the same procedure as in Step 3 of the proof of Theorem 7.4.2 in 
Section 7.7.2 in dealing with the first term of (9.9.20), we obtain 

C T e 0 = Af,( * )< ?™ (a) em 2 + r 2 + W(s) V (9.9.21) 

where r 2 consists of terms of the form 1 Fi(s)w t 0 p with u/m G and fFi(s), W(s) 
being proper and analytic in Re[s] > — <5 q/ 2. Applying Lemma 3.3.5 to (9.9.21) 



9.9. STABILITY PROOFS OF ROBUST APPC SCHEMES 


767 


and noting that any state of a minimal state-space representation of the transfer 
functions in (9.9.21) is bounded from above by m, we obtain 

||(C' T eo)*,i 1 || < ce _ s (t_tl) m(ti) + c||(em 2 ) Ml || + c||(0 p m) t)tl || + c||% )tl || (9.9.22) 
Using (9.9.22) in (9.9.19), and noting that ||% )tl || < A 00 nif 1 , we obtain 

fn 2 h {t) < ce~ 5{t ^ tl \ 1 + ?n 2 (ti)) + c||(<p7i) t , tl || 2 + cA^m/^t), \/t £ [fi,f 2 ) 
where g 2 = e 2 m 2 + \0 p \ 2 . Therefore, for 

cA^ < 1 (9.9.23) 


we have m 2 (f) < rrP^it) and 


m 2 (t)<ce *^(1 + m 2 (ti)) + c f g 2 {r)m 2 {T)dT , Vte[ii,i 2 ) (9.9.24) 

Jti 

Applying B-G Lemma III we obtain 

m 2 (<) < c(l + m 2 (fi))e _ ' 5( - t_ * 1 ^e h ** ^ + c6 f e~ s ^~ s ^e c ^ ® ^ dT ds 

• 't i 


Because ?n 2 (t) > a Wt £ [ti,t 2 ), we have c f* g 2 (r) dr < c(A| + fo + — s) + 

c,Vt, s € [ti,t 2 ). Choosing d large enough so that < |, and restricting A 2 ,/o 
to satisfy 

c(A 2 + / 0 ) < - (9.9.25) 

we obtain 

m 2 (t 2 ) < c(l + m 2 (ti))e~i l ' t2 ~ tl ' > + c 

Hence, 

d < m 2 (t 2 ) < c(l + d)e - ^ + c 

which implies that for d large, m 2 (t 2 ) < d which contradicts the hypothesis that 
m 2 (t 2 ) > d. Hence, m £ Coo which implies that e, e, e D , y p , u p £ Coo- 

Using (9.9.23), (9.9.25) and c(A| + /o) < A in Step 2, we can define the constants 
A ^ , S* > 0 stated in the theorem. 


Step 4. Tracking error bounds. The tracking error equation is given by 

ei = C T e 0 + C T e + dig 


We can verify that the £ 2 e-norm of e and C T e 0 satisfy 


Pt|| 2 e < c||(C' T e 0 ) t || 2 e + c||r?t||2e 

||(C T e 0 )t||2e < c\\em.t\\ 2 e + c\\d pt \\ 2 e + c\\r]the 
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by following the same approach as in the previous steps and using 5 = 0, which 
imply that 


IMIL 


A 



< c f (e 2 m 2 + \0. p \ 2 + rj 2 )dT 

Jo 

< c( A2 + fo + dl)t + c 


and the proof is complete. 


□ 


9.10 Problems 


9.1 Show that if u = simo^t is dominantly rich, i.e. , 0 < ujq < 0(l//x) in Example 

9.2.3, then the signal vector <j> is PE with level of excitation a 0 > 0(/i). 

9.2 Consider the nonminimum-phase plant 

y p = G(s)u p , G(s) = s2 + a ^ + b 


where /i > 0 is a small number. 


(a) Express the plant in the form of equation (9.3.1) so that the nominal 

part of the plant is minimum-phase and the unmodeled part is small for 
small /i. 

(b) Design a robust MR.AC scheme with a reference model chosen as 


W m {s) 


1 

s 2 + 1.4s + 1 


based on the nominal part of the plant 

(c) Simulate the closed-loop MR.AC system with a = —3, b = 2 and r =step 
function for /x = 0, 0.01,0.05, 0.1, respectively. Comment on your simu- 
lation results. 


9.3 In Problem 9.2, the relative degree of the overall plant transfer function is 

n* = 1 when /x 7^ 0. Assume that the reference model is chosen to be 
W m i(s) = . Discuss the consequences of designing an MR AC scheme 

for the full order plant using W m i(s) as the reference model. Simulate the 
closed-loop scheme using the values given in part (c) of Problem 9.2. 

9.4 For the MR AC problem given in Problem 9.2, 

(a) Choose one reference input signal r that is sufficiently rich but not dom- 
inantly rich and one that is dominantly rich. 
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(b) Simulate the MR.AC scheme developed in Problem 9.2 using the input 
signals designed in (a). 

(c) Compare the simulation results with those obtained in Problem 9.2. 

Comment on your observations. 


9.5 Consider the plant 


Up = 


1 

s{s + a) 


where a is an unknown constant and 


Vm ~ (a +3) 2r 


is the reference model. 

(a) Design a modified MR.AC scheme using Method 1 given in Section 9.4.2 

(b) Simulate the modified MR.AC scheme for different values of the design 
parameter r. 

9.6 Consider Problem 9.2. 

(a) Replace the standard robust MR.AC scheme with a modified one from 

Section 9.4. Simulate the modified MR.AC scheme using the same pa- 
rameters as in Problem 9.2 (c) and r = 0.1. 

(b) For a fixed /i (for example, fj, = 0.01), simulate the closed MR.AC scheme 
for different r. 

(c) Comment on your results and observations. 

9.7 Consider the speed control problem described in Problem 6.2 of Chapter 6. 

Suppose the system dynamics are described by 

V = — ■ — (1 + A m (s))0 + d 
s + a 

where d is a bounded disturbance and A m represents the unmodeled dynam- 
ics, and the reference model 


V m 


0.5 

s + 0.5 


V. 


is as described in Problem 6.2. 

(a) Design a robust MR.AC scheme with and without normalization 

(b) Simulate the two schemes for a = 0.02 sin O.Olf, b = 1.3, A m (s) = — 
and d = 0 for fi = 0, 0.01, 0.2. Comment on your simulation results. 
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9.8 Consider the plant 

2/ = u - yA a (s)u 

s — a 

where y > 0 is a small parameter, a is unknown and A a (s) is a strictly proper 
stable unknown transfer function perturbation independent of y. The control 
objective is to choose u so that all signals are bounded and y tracks, as close 
as possible, the output y rn of the reference model 

1 

V m = — — rr 
s + 1 

for any bounded reference input r as close as possible. 

(a) Design a robust MRAC to meet the control objective. 

(b) Develop bounds for robust stability. 

(c) Develop a bound for the tracking error e\ = y — y m . 

Repeat (a), (b), (c) for the plant 


V = u 

s — a 

where r > 0 is a small constant. 

9.9 Consider the following expressions for the plant 

2/p = G 0 (s)u p + A a (s)u p 


y P = 


N 0 (s) + Aj (s) 


(9.10.1) 

(9.10.2) 


D 0 (s) + A 2 ( s ) p 

where A Q , Ai, A 2 are plant perturbations as defined in Section 8.2 of Chapter 


( a ) 


Design a model reference controller based on the dominant part of the 
plant given by 

2/p = G 0 (s)u p (9.10.3) 


where Gq(s) 
P4 and 


TTTsj = fcp fl’(g) satisfies the MR.AC Assumptions PI to 


2/m = W m (s)r 


Z m (s ) 

V 

Rm(s') 


is the reference model that satisfies Assumptions Ml and M2 given in 
Section 6.3 of Chapter 6. 

(b) Apply the MRC law designed using (9.10.3) to the full-order plants given 
by (9.10.1), (9.10.2) and obtain bounds for robust stability. 
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(c) Obtain bounds for the tracking error e\ = y p — ?/„ 
9.10 Consider the plant 


where 


Up = G 0 (s)u p + A a (s)u p 


y p = G 0 {s)u p 


is the plant model and A a (s) is an unknown additive perturbation. Consider 
the PPC laws given by (9.5.3), (9.5.5), and (9.5.7) designed based on the 
plant model but applied to the plant with A a (s) ^ 0. Obtain a bound for 
A„(s) for robust stability. 


9.11 Consider the plant 


V P = 


No + Ai _ 
Dn + Aq 


where G 0 = is the modeled part and A 1; A 2 are stable factor perturba- 
tions. Apply the PPC laws of Problem 9.10 to the above plant and obtain 
bounds for robust stability. 

9.12 Consider the robust APPC scheme of Example 9.5.1 given by (9.5.18), (9.5.19) 
designed for the plant model 


Vp = 


but applied to the following plants 


(i) V P = — : — u p + A a (s)i 
s + a 

r .N i+A7 +Al ( s ) 


where A a (s) is an additive perturbation and Ai(s),A 2 (s) are stable factor 
perturbations and Ao > 0. 

(a) Obtain bounds and conditions for A 0 (s), Ai(s), A 2 (s) for robust stabil- 

ity. 

(b) Obtain a bound for the mean square value of the tracking error. 

(c) Simulate the APPC scheme for the plant (i) when b = 2(1 + 0. 5 sin O.Olf), 

a = — 2sin0.002f, A a (s) = — for /.t = 0,0.1, 0.5, 1. 

9.13 Consider the robust APPC scheme based on state feedback of Example 9.5.2 
designed for the plant model 


Vp = 
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but applied to the following plants: 

(i) y P = ~^ u p + N a (s)u p 


(ii) 


Vp = 


b 

S + Ap 

S + Aq 


Ai(s) 

A 2 (s) 


Up 


where A a (s) is an additive perturbation and A 1 (s),A 2 (s) are stable factor 
perturbations and Aq > 0. 


(a) Obtain bounds and conditions for A a (s), Ai(s), A 2 (s) for robust stabil- 
ity. 

(b) Obtain a bound for the mean square value of the tracking error. 

(c) Simulate the APPC scheme with plant (ii) when 


(e~ TS - 11 

6 = 1, a = — 2sin0.01i, Ao = 2, AAs) = , A 2 (s) = 0 

s + 2 

for t = 0,0.1, 0.5,1. 

9.14 Simulate the ALQC scheme of Example 9.5.3 for the plant 

y P = jqjq^(l + A m{s))u p 
where A m (s) = and / 1 , > 0. 

(a) For simulation purposes, assume b = — 1, a = —2(1 + 0.02 sin O.lf). Con- 

sider the following values of y: y — 0, y = 0.05, \x = 0.2, y = 0.5. 

(b) Repeat (a) with an adaptive law that employs a dead zone. 

9.15 Consider the following MIMO plant 


Ui 


hii(s) fti 2 (s) 


Ml 

y-2 


0 6-22 (s) 


u 2 


where hu = q ^-,/112 = £iA(s),/i 2 2 = qq^p and A(s) is strictly proper and 
stable. 

(a) Design a decentralized MRAC scheme so that y\ , t/ 2 tracks y m \ , y m i , the 
outputs of the reference model 


Uml 


\ 5TT ° ' 


n 

Um2 


0 — 

L W s+2 J 


. r 2 . 


for any bounded reference input signal ri,r 2 as close as possible, 
(b) Calculate a bound for /zA(s) for robust stability. 
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9.16 Design a MIMO MR.AC scheme using the CGT approach for the plant 


x = Ax + Bu 
y = C T x 


where x € 1Z 2 , u G 1Z 2 , y € 77 2 and ( A , B, C) corresponds to a minimal state 
representation. The reference model is given by 



O 

CM 

1 


' 1 ' 

%m — 

i 

O 

1 

to 

X m + 

1 


Urn — %m 


Simulate the MR.AC scheme when 


A = 


-3 1 

0 0.2 ’ 


B = 


1 0 
0 2 


C = 


1 0 
0 1 



Appendix 


A Swapping Lemmas 

The following lemmas are useful in establishing stability in most of the adap- 
tive control schemes presented in this book: 

Lemma A.l (Swapping Lemma A.l) Let 0,w : TZ + > lZ n and 0 be 

differentiable. Let W{s) be a proper stable rational transfer function with a 
minimal realization (A, B,C,d), i.e., 

W(s) = C T (sI — A)~ 1 B + d 


Then 


where 


W(s)9 t lu = d T W(s)u} + VL c (s) ( (Wft(s)u; 1 )9 


W c (s) = —C t (sL - A)- 1 , W b (s) = ( si - A)~ 1 B 
Proof We have 

W(s)0 t lo = W(s)w T 0 = d0 T w + C T [ e A{t ~ T) Bw T 0dr 

Jo 


= dO T u> + C T e At 


T— 0 


/o JO 
rt 


e Aa B lo t (a)daO(T) 
e~ Aa Blo t (a)da9(T)dT 

y 

dw + C T f e A ( t_<7 ) Bu>(a)da 
Jo 

-C T [ e A(t ~ r) [ e A{ - T - a) B w t {a)da0(T)dr 


(A.l) 
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Noting that 


duj + C r [ e A(t - a) Bu{a)d<j = (d + C T (sI - A)~ 1 B)tv = W(s) 
Jo 

f e A< ' t ~ a ' > B lo t ( a) da = (si — A)~ 1 Buj y 
Jo 


and 


C T [ e Ait ~ T) f(T)dT = C T (sI-A)- 1 f 

Jo 

we can express (A.l) as 

W(s)6 T tv = F W(s)tv - C T (sI - A)- 1 { ((si - A)~ 1 Blo t ) 0} (A. 2) 

and the proof is complete. 


□ 


Other proofs of Lemma A.l can be found in [201, 221]. 

Lemma A. 2 (Swapping Lemma A. 2) Let 6,00 : IZ + 1 — > JZ n and 9, tv be 

differentiable. Then 


Ftv = Fi(s,a 0 ) 


9 tv + Flo 


+ F(s , oto)\fF tv 


(A.3) 


where F(s,a 0 ) = 1 F ( g > a o) = 1 F G’ a A ^ \ anc i aQ > q is an 

arbitrary constant. Furthermore, for ao > 5 where 6 > 0 is any given 
constant, Fi(s,ao) satisfies 

\\Fi(s, ao)]^ <^~ 
ot 0 

for a finite constant c £ IZ + which is independent of a^. 

Proof Let us write 

Ftv = (1 — F(s, a 0 )) Ftv + F(s, a 0 )Fiv 

Note that 

k k k 

k \ ' A _ k—i ^ k , \ ^ s~ii A _ k—i ^ k , _ \ ^ syi A— 1 _ k—i 


(s + a 0 ) fe — ^2 

i = 0 


k s " a b ~ — a'o + C l k s"a'Q " — ajj + s u k s " ~ a b 


i - 1 


i = 1 
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where <7| = (0! = 1), we have 


(s + a 0 ) k -a k 0 _ XhCis*- 1 ^-* 


(s + a 0 ) k S (s + a 0 ) k 


Defining 


we can write 


a Ell Cis*- 1 ^-* 1-F(s,a 0 ) 

fi{s,a 0 ) - 7 , 77 

(s + a 0 J s 


9 w = Fi(s,a 0 )s(6 | w) + F(s, a 0 )(9 u) 

= Fi(s,a o )(0 u> + 9 T u) + F(s, a o )(0 T w) 


(A.4) 


To show that ||Ti(s, cko) IIqo,? < ^ for some constant c, we write 

1 


«0 

& cA — 


f’ 1 ( S ,a„) = E -' C * S, "“' 


0 




x fc — 2 


(s + a 0 ) fe s + (s + a 0 )* 1 (5 + a 0 ) 


\k—i 


Because 


and 


(s + a 0 y 
1 


00(5 


S + OlQ 

we have 

II - f i ( s 5 Q : o )|| oo5 < 


c 

< — 
«o 


^0 

) 

1 _ f 2a 0 

ao 

oo(5/ 

\ 2ao — <5 


2 

s’” 1 

o 0 ’ 

(s + a 0 ) ,_1 


< 2*, i > 1 


= 1, i > 1 


1 

k 

Y.a 

oo<5 i — i 

S 4 " 1 


T 

"Sfi o 

Q 


s + q:q 

(s + a 0 ) i_1 

00(5 

(s + a 0 ) fc * 

00(5 


E. fc ( 27 * 2 fc — *+! 
2 cko — S 


where c = Ei=i Cfc2 fc 1+1 is a constant independent of ag. The above inequalities 
are established by using ao > 5. □ 


In the stability proofs of indirect adaptive schemes, we need to manipu- 
late polynomial operators which have time- varying coefficients. We should 
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note that given a(t) E 7Z n+1 , and a n (s) = [s”, s n 1 , . . . , s, 1] T , A(s, t),Afs, t) 
defined as 

A{s,t ) = a n (t)s n + a n _i(t)s n_1 4 h ai(i)s + a 0 (i) = a T (t)a n (s) 

A(s,t) = s n a n (t ) + s" _1 a n _i(t) 4 hsai(t) + a 0 (t) = aj(s)a(t) 

refer to two different operators because of the time- varying nature of a(t). 
For the same reason, two polynomial operators with time- varying parameters 
are not commutable, i.e., 

A(s, t)B(s , t ) B(s, t)A(s , t), A(s, t)B(s, t ) 5(s, t)A(s, t) 

The following two swapping lemmas can be used to interchange the sequence 
of time-varying polynomial operators that appear in the proof of indirect 
adaptive control schemes. 

Lemma A. 3 ^Swapping Lemma A.3J. Let 

A(s,t ) = a n (t)s n + a n _i(f)s n-1 4 ai(t)s + a 0 (t) =a T (t)a n (s) 

B(s,t ) = b m (t)s m + b m -i(t)s m ~ 1 4 &i(i)s + 6 0 (t) = b T (t)a m (s) 

be any two polynomials with differentiable time-varying coefficients, where 
a = [a n , a n _i, . . . , a 0 ] T , b = [b m , b m -i, . . . , b 0 \ T , a(t ) E K n+l , b(t ) G IZ m+1 
and ai(s) = [s*, s* _1 , • • • , s, 1] . Let 

C(s, t ) = A(s, t ) • B(s, t ) = B(s, t ) • A(s, i) 

be the algebraic product of A(s,t ) and B(s,t). Then for any function f(t) 
such that C(s,t)f , A(s,t)(B(s,t)f), B(s,t)(A(s,t)f) are well defined, we 
have 


(i) A(s,t)(B(s,t)f) = C{s,t)f + a T {t)D n _i(s) a n -i (s)(a^(s)f)b 


Mi 


(A.5) 


and 
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(ii) A(s, t)(B(s , t)f) = B(s, t)(A(s , t)f) + G(s, t ) ^(i7(s)/) 

(A.6) 

where 

G(s,t ) = [a T J D n _i(s),6 T L> m _i(s)] 

Di(s), H(s) are matrices of dimension (i + 2) x (* + 1) and (n + m) x 
(ra+l + m + 1) respectively, defined as 

1 s s 2 • • • s* 

0 1 s • • • 

0 0 ■■■ ■■■ : 

: 1 s 

0 ••• 01 
0 ••• 00 

tt( \ ^ 0 a n _i(s)a m (s) 

-a m _i(s)aj(s) 0 

Proof (i) Using the relation s(/i/ 2 ) = f\sf -2 + (s/i)/ 2 , we can write 

n m 

A(s,t)(B(s,t)f) = a 0 B(s,t)f + E a ‘E s ' (w) 

*=1 j=0 

n m 

= ao-B(M)/ + E ai E s * _1 (bjs^ 1 f + bjs 3 (A.7) 

i=l j=0 
n m 

= a 0 -B(s,t)/ + E a *E [s* -2 (bjS 3+2 f + E J+1 /) +s‘^ 1 b J s :, f 

i— 1 j = 0 

: (A-7) 

n m / i— 1 

= aoB(s, t)f + J2 a > E & E + V + E $k { i, P J i ' k f 

i—1 j — 0 \ k—0 

n m n m i—1 

= E a ‘ E h n ‘ ' 7 + E E E sfc (V +i 1 */) = c(a t)f + r(t) 

i=0 j=0 «— 1 j=0 /c— 0 

n i—1 m 

^) = E a *E sfc E(E j+i - 1 - fe /) 

i=l fc=0 j=0 






where 
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and the variable t in a, b, / is omitted for the sake of simplicity. 

From the expression of r(<), one can verify through simple algebra that 


where 


r(t) = a T D n _i(s)r b (t) 


ET=ob^ +n -\f 

E7=o^ + - 2 / 


(A.8) 


nit) = 


£7=oM j+ 7 

To simplify the expression further, we write 


f 

gU— 1+ra $n— 1+ra— 1 

• s n— 1 ' 

) 


brn 

gU— 2+ra ^n—2-\-m— 1 

• s n ~ 2 


bm— 1 




/ 



V 

s m s™' 1 

1 

J 


- ^>0 . 


nit) = 

= (a„_i(s)oA(s)/) b 


(A.9) 


Using (A.8) and (A.9) in (A. 7), the result of Lemma A. 3 (i) follows immediately, 
(ii) Using the result (i), we have 


i(s) ([a n _i(s)aA(s)/] &) 

(A. 10) 

i(s) ([a m _i(s)aA(s)/] a) 

(A.ll) 


(A- 12) 


Subtracting (A. 11) from (A. 10), we obtain 

Ais,t)iBis,t)f) ~ Bis,t)iAis,t)f) 

= a T L>„_i(s) (J(a n _i(s)a7( s )/] &) - 6 T D m _i(s) ( [(a m _i(s)a^(s)/] a) 

The right-hand side of (A. 12) can be expressed, using block matrix manipulation, 


as 


a r D n _ 1 is),b T D m _ 1 is) 


0 a n -i (s)«7( s ) 

-a m -i is)alis) 0 


and the proof is complete. 


□ 
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Lemma A. 4 (Swapping Lemma A. 4) Let 

A(s,t) = a n (t)s n + a n -i{t)s n ~ 1 H a\(t)s + ao(t) = a T (t)a n (s) 

A(s,t ) = s n a n (t ) + s n_1 a n _i(t) H sa\(t) + ao(t) = a^(s)a(t) 

B(s,t ) = b m {t)s m + H 6i(i)s + b Q (t) = b T (t)a m (s) 

B(s,t ) = s m b m (t ) + s m_1 6 m _i(t) H s&i(t) + 6 0 (t) = oA(s)&(t) 

where s=fjf, a(i) e7£ n+1 , 6(i) G 7£ m+1 and a, b € Coo, a* = [s*,- • • , s, 1] T . Let 

n m n m 

A(s, i) • B(s, t ) = EE a^s*A A(s,t) ■ B(s,t) = EE s i+j aibj 

i=0 j=0 i=0j=0 

be the algebraic product of A(s,t), B(s,t) and A(s,t), B(s,t), respectively. 
Then for any function f for which A(s,t) ■ B(s,t)f and A(s,t) (B(s,t)f) 
and B(s,t) ( A(s,t)f ) are defined, we have 

(i) A(s,t)(B(s,t)f) = B(s,t)(A(s,t)f) + a£(s)F(a,b)a n (s)f (A.13) 

where n = max{n, m} — 1 and F(a,b) satisfies ||F(a,6)|| <ci|a| + C 2 |d| 
for some constants c\,C 2 

(ii)A(s, t ) ( B(s , t) —Ur/) = -rTT^( s ’ 0 ' B (s, *)/ + (s)G(s, f, a, b) 

\ A o(s) / A 0 (s) 

(A.14) 

for any Hurwitz polynomial Ao(s) of order greater or equal to m, where 
G(s , /, a, 6) is defined as 

A m 

G(s, /, a, 6) = [g n , . . • ,5i,5o],5j = -E W 0c( s ) ((^?&( s )/)(aA+ a A)) 

i=o 

and Wj c , Wjb are strictly proper transfer functions that have the same 
poles as 

Proof (i) From the definition of A(s, t),A(s, t),B(s, t),B(s, t), we have 

n m 

A(s, t)(B(s, t)f) = EE^w) 

2—0 j — 0 


(A- 15) 
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B(s, t)(A(s, t)f) = EE s J (a,bjs l f) (A. 16) 

i—0 j = 0 

Now we consider s l {aib 3 s 3 f) and treat the three cases i > j, i < j and i — j 
separately: First, we suppose i > j and write 

s l {aibjS 3 f) = s 3 {s l - 3 {a t b 3 s 3 /)) 

Using the identity s(aibjs l f) = aibjS l+1 f + ( a,6j + a,ibj)s l f , where l is any integer, 
for i — j times, we can swap the operator s l ~ 3 with a,6j and obtain 

i-j 

s l ~ 3 {a i b j s 3 f) = dibjS l f + ^ s l ~ 3 ~ k (a l b ] + a i 6 j )s J_1+fe / 

k = 1 

and, therefore, 

i-j 

s' l (a i b j s j f) = s j {a i b j s l f) + ^ s l ~ k {aibj + aibj)s j ~ 1+k f, i > j (A.17) 

k = 1 

Similarly, for i < j, we write 

s l (aibjS J f) = s l ((a i b j s 3 ~ l )s l f) 

Now using afijS 1 f = s(a.ibj)s 1 ^ 1 f — ( a.ibj + a»i>j)s z_1 / , where l is any integer, for 
j — i times, we have 

(a i b j s J - l )s l f = s 3 ~ l (a i b j )s l f - ^ s- 7 “^ fe (a i 6 i + a,bj)s l ^ 1+k f 

k = 1 

and, therefore, 

j-i 

s z (a,ibjS 3 f) = s 3 (aibjS 1 f) - ^ s 3 ~ k (a i b j + a i & J )s i_1+fc /, i < j (A.18) 

fc=i 

For i = j, it is obvious that 

s l (aibjS 3 f) = s 3 {a i bjS l f) 1 i = j (A.19) 

Combining (A.17), (A.18) and (A.19), we have 

n m 

A(s,t)(B(s,t)f ) = EE s 3 (a i b j s l f) + n = B(s,t)(A(s,t)f) + n (A. 20) 

i—0 j—0 
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where 

n m i—j n m j—i 

r i = +ai b j)s J ~ 1+k f -'Y^Y^2, s3 ~ k {ai b i + a i 6 i )s* _1+fc / 

0 j = o k = 1 i =0 j=o /c — 1 

j<% j>i 

Note that for 0 < i < n, 0 < j < m and 1 < k < \i — j |, we have 

j < i — k, j + k — 1 < i — 1, if i > j 
i < j — k, i + k — 1 < j — 1, if i < j 

Therefore all the s-terms in r\ ( before and after the term aibj + di.bj ) have order 
less than max{n,?n} — 1, and ry can be expressed as 

n = a?(s)F(a,b)a n (s)f 

where n = max{n, m} — 1 and F{a 1 b) € 77" xn is a time-varying matrix whose 
elements are linear combinations of a ibj + aibj. Because a, b £ £oo, it follows from 
the definition of F(a, b) that 

\\F(a,b)\\ < ci|d| + c 2 |6| 

(ii) Applying Lemma A.l with W(s) = , we have 

gj gj . 

= TTX aib ^ ~ WjcdWjbfliciibj + afij)) 
ilofSj A 0 (SJ 

where Wj c (s),Wjb(s) are strictly proper transfer functions, which have the same 
poles as Ao * s ) . Therefore 


A(s,t) (B(s,t)j^-jf) =J^J^s i (a i b 


i—0 j — 0 
n m 


A 0 (s) ‘ 


= EEuh (ai ^-EE siWjcttWMoibj + cubs)) 

i = 0 j = 0 7 ^ 


*=0 j = 0 


1 


Ao(s) 


A(s,t) ■ B(s,t)f + r 2 


(A.21) 


where 

n m 

r2 = slw jc((W jb f)(aibj + aibj)) 

i = 0 j — 0 

From the definition of r 2 , we can write 

?’ 2 = al(s)G(s,f, a,b) 
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by defining 

m 

G(s,f,a,b) = [g n ,,...,gi,g 0 \, g 3 = -^W jc (s) ((IT j ; ) /)(a i & j 


B Optimization Techniques 

An important part of every adaptive control scheme is the on-line estimator 
or adaptive law used to provide an estimate of the plant or controller param- 
eters at each time t. Most of these adaptive laws are derived by minimizing 
certain cost functions with respect to the estimated parameters. The type of 
the cost function and method of minimization determines the properties of 
the resulting adaptive law as well as the overall performance of the adaptive 
scheme. 

In this section we introduce some simple optimization techniques that 
include the method of steepest descent, referred to as the gradient method, 
Newton’s method and the gradient projection method for constrained mini- 
mization problems. 

B.l Notation and Mathematical Background 

A real-valued function / : IZ n 1 — > 72. is said to be continuously differentiable 
if the partial derivatives d g^ , • • • , ^g^ exist f° r each x £ IZ n and are con- 
tinuous functions of x. In this case, we write / £ C 1 . More generally, we 
write / £ C m if all partial derivatives of order m exist and are continuous 
functions of x. 

If / £ C 1 , the gradient of / at a point x £ 7 Z n is defined to be the 
column vector 

‘ df(x) 
dxi 

v/(x) = : 

om 

- dXn 

If / £ C 2 , the Hessian of / at x is defined to be the symmetric n x n matrix 
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having d 2 f(x) /dxidxj as the ij th element, i.e., 


oxioyj 


A subset S of 7 Z n is said to be convex if for every x,y E S and a £ [0, 1], 
we have ax + (1 — a)y £ S. 

A function / : S i— ► 1Z is said to be convex over the convex set S if for 
every x,y £ S and a £ [0, 1] we have 

f(ax + (1 - a)y) < af(x) + (1 - a)f(y) 

Let / £ C 1 over an open convex set S, then / is convex over S iff 

f{y) > f(x) + {Vf(x)) T (y - x), Vx,y€S (B.l) 

If / £ C 2 over S and V 2 /(x) > 0 V.x £ S , then / is convex over S. 

Let us now consider the following unconstrained minimization problem 

minimize J{6) , , 

subject to 6 £ TZ n 

where J : IZ n ^ IZ is a given function. We say that the vector 8* is a global 
minimum for (B.2) if 

j(6*) < j( 6) ye £ n n 


A necessary and sufficient condition satisfied by the global minimum 8* is 
given by the following lemma. 

Lemma B.l Assume that J £ C 1 and is convex over TZ n . Then 8* is a 
global minimum for (B.2) iff 

VJ{8*) = 0 

The proof of Lemma B.l can be found in [132, 196]. 

A vector 8 is called a regular point of the surface Sg = {8 £ 7 Z n \g(0 ) = 0 } 
if X g(8) / 0. At a regular point 8. the set 

M(8) = [d £ 7^ n 1 8 T Vg(0) = o} 

is called the tangent plane of g at 8. 
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B.2 The Method of Steepest Descent (Gradient Method) 

This is one of the oldest and most widely known methods for solving the 
unconstrained minimization problem (B.2). It is also one of the simplest for 
which a satisfactory analysis exists. More sophisticated methods are often 
motivated by an attempt to modify the basic steepest descent technique for 
better convergence properties [21, 132, 196]. The method of steepest descent 
proceeds from an initial approximation 4 for the minimum 9* to successive 
points 9 - ■ ■ in IZ n in an iterative manner until some stopping condition 
is satisfied. Given the current point 9 *., the point 4+1 is obtained by a linear 
search in the direction 4 where 

4 = -VJ(4) 

It can be shown [196] that 4 is the direction from 4 in which the initial rate 
of decrease of J(0) is the greatest. Therefore, the sequence { 9 4 is defined 
by 

4+i = 4 + Afc4 = 4 - AfcVJ(4), (fc = 0,1,2, •• •) (B.3) 

where 4 is given and Xk , known as the step size or step length , is determined 
by the linear search method, so that 4+ 1 minimizes J(9) in the direction 4 
from 4- A simpler expression for 4+ 1 can be obtained by setting A& = A Vfc, 
i.e., 

4+1 = 4 - AVJ(4) (B.4) 

In this case, the linear search for A& is not required, though the choice of the 
step length A is a compromise between accuracy and efficiency. 

Considering infinitesimally small step lengths, (B.4) can be converted to 
the continuous-time differential equation 

0 = -VJ(0(t)), 9 (t 0 ) = 9 0 (B.5) 

whose solution 9(t) is the descent path in the time domain starting from 
t = t 0 . 

The direction of steepest descent d = — VJ can be scaled by a constant 
positive definite matrix T = T t as follows: We let T = 4r7 where 4 is an 
n x n nonsingular matrix and consider the vector 8 E IZ n given by 


40 = 9 
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Then the minimization problem (B.2) is equivalent to 

minimize J(9) = J(Ti6) /p> 

subject to 9 € lZ n 

If 9* is a minimum of J, the vector 9* = 1^0* is a minimum of J. The 
steepest descent for (B.6) is given by 

h+1 = h~ A V J(0 fe ) (B.7) 

Because VJ(9) = C>J = T^VJ^) and V\9 = 9 it follows from (B.7) that 

9 k+1 = 9 k - \T 1 TjVJ(9 k ) 

Setting T = TiT^, we obtain the scaled version for the steepest descent 
algorithm 

0 k+ i = 0 k - XTVJ(9 k ) (B.8) 

The continuous-time version of (B.8) is now given by 

9 = —PVJ(9) (B.9) 


The convergence properties of (B.3), (B.4), (B.8) for different step lengths 
are given in any standard book on optimization such as [132, 196]. The 
algorithms (B.5), (B.9) for various cost functions J{9 ) are used in Chapters 4 
to 9 where the design and analysis of adaptive laws is considered. 

B.3 Newton’s Method 

Let us consider the minimization problem (B.2) and assume that J(9) is con- 
vex over lZ n . Then according to Lemma B.l, any global minimum 9* should 
satisfy X7J(9*) = 0. Usually V J{9*) = 0 gives a set of nonlinear algebraic 
equations whose solution 9* may be found by solving these equations using 
a series of successive approximations known as the Newton’s method. 

Let 9 k be the estimate of 9* at instant k. Then VJ(0) for 9 close to 9 k 
may be approximated by the linear portion of the Taylor’s series expansion 

VJ(9) ~ VJ(9 k ) + 4 VJ(9 ) | e =e k (9 ~ Ok) 


(B.10) 
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The estimate d^+i of 8* at the k + 1 iteration can now be generated from 
(B.10) by setting its right-hand side equal to zero and solving for 8 = &k+ i, 
i.e., 

8 k+1 = 8 k -H-\8 k )VJ(8 k ), k = 0,1,2,... (B.ll) 

where H{6Q = -ggV J{&k) = V 2 J(0fc) is the Hessian matrix whose inverse is 
assumed to exist at each iteration k. 

A continuous time version of (B.ll) can also be developed by constructing 
a differential equation whose solution 0(t) converges to the root of V J(8) = 0 
as t — » oo. Treating 8{t) as a smooth function of time we have 

|vJ(0(t)) = ^VJ(0)0 = H(8)8 (B.12) 

Choosing 

8 = -(3H~ 1 {8)VJ{8) i 8(t 0 ) = 8 0 (B.13) 

for some scalar f3 > 0, we have 

= -pvj(8) 

or 

VJ(0(t)) = e — 0(*-*°)vJ(0 o) (B.14) 

It is therefore clear that if a solution 8(t) of (B.13) exists Vt > t$, then 
equation (B.14) implies that this solution will converge to a root of VJ = 0 
as t — > oo. 

When the cost J depends explicitly on the time t, that is J = J(8, t ) and 
is convex for each time t, then any global minimum 8* should satisfy 

VJ(0*,i)= 0, Vt > t 0 > 0 
In this case, (B.12) becomes 

j t V J(6, t ) = ^ V J(0, t)0 + J(0, t) 

Therefore, if we choose 

0 = —H~ 1 (9) ^/3VJ(0, t) + ^VJ(0,t)) 


(B.15) 
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for some scalar (3 > 0, where H(9,t) = j^'VJ(9,t) we have 

or 

V J(0(t), t) = e -^- t o)vJ(0(fo), to) (B.16) 

If (B.15) has a solution, then (B.16) implies that such solution will converge 
to a root of V J = 0 as t — > oo. 

Newton’s method is very attractive in terms of its convergence properties, 
but suffers from the drawback of requiring the existence of the inverse of 
the Hessian matrix H(0) at each instant of time (k or t). It has to be 
modified in order to avoid the costly and often impractical evaluation of the 
inverse of H. Various modified Newton’s methods use an approximation 
of the inverse Hessian. The form of the approximation ranges from the 
simplest where H remains fixed throughout the iterative process, to the 
more advanced where improved approximations are obtained at each step 
on the basis of information gathered during the descent process [132], It 
is worth mentioning that in contrast to the steepest descent method, the 
Newton’s method is “scale free” in the sense that it cannot be affected by a 
change in the coordinate system. 

B.4 Gradient Projection Method 

In sections B.2 and B.3, the search for the minimum of the function J(0) 
given in (B.2) was carried out for all 9 G TV . In some cases, 9 is constrained 
to belong to a certain convex set 

S = {9eK n \g(6) <0} (B.17) 

in IZ n where g(-) is a scalar-valued function if there is only one constraint 
and a vector-valued function if there are more than one constraints. In this 
case, the search for the minimum is restricted to the convex set defined by 
(B.17) instead of TV. 

Let us first consider the simple case where we have an equality constraint, 
that is, we 


minimize J(9) 
subject to g(9) = 0 


(B.18) 
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where g(9) is a scalar-valued function. One of the most common techniques 
for handling constraints is to use a descent method in which the direction 
of descent is chosen to reduce the function J(9) by remaining within the 
constrained region. Such method is usually referred to as the gradient 
projection method. 

We start with a point 9q satisfying the constraint, i.e. , g(9o) = 0. To 
obtain an improved vector 9\ . we project the negative gradient of J at 9q 
i.e., — VJ(^o) onto the tangent plane M.(9q) = G IZ n Vg T (#o)# = oj 
obtaining the direction vector Pr(#o)- Then 9\ is taken as 9q + AoPr(0o) 
where Ao is chosen to minimize J{9\). The general form of this iteration is 
given by 

&k + 1 = + ^fcP r ($fc) (B.19) 

where A k is chosen to minimize J(9 k ) and Pr(0^) is the new direction vector 
after projecting —VJ(9 k ) onto M.(9 k ). The explicit expression for Pr(#fc) can 
be obtained as follows: The vector —X7J(9 k ) can be expressed as a linear 
combination of the vector Pr(0^) and the normal vector N(9 k ) = Vg(9k) to 
the tangent plane M.(9Q at 9 i.e., 

-VJ(0 fc ) = a.Vg(9 k ) + Pr (9 k ) (B.20) 

for some constant a. Because Pr(0fc) lies on the tangent plane Jd(9 k ), we 
also have V g T (9 k )Pi'(9 k ) = 0 which together with (B.20) implies that 

— V< 7 T VJ = a'Vg T 'Vg 


a = -(Vg T Vg)- 1 Vg T VJ 
Hence, from (B.20), we obtain 


Pr (0*) = - [/ - Vg(Vg T Vg)- 1 Vg T ] VJ (B.21) 


We refer to Pr(0^) as the projected direction onto the tangent plant M.(9 k ). 
The gradient projection method is illustrated in Figure B.l. 

It is clear from Figure B.l that when g(9) is not a linear function of 9 , the 
new vector 9 k+ \ given by (B.19) may not satisfy the constraint, so it must 
be modified. There are several successive approximation techniques that can 
be employed to move 9 k+ \ from M.(9 k ) to the constraint surface g(9) = 0 
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Vg(6 k ) -VJ(0 k ) M(0 k ) 



[132, 196]. One special case, which is often encountered in adaptive control 
applications, is when 9 is constrained to stay inside a ball with a given center 
and radius, i.e., g(9) = (9 — 9o) T (9 — 9 q) — M 2 where 9$ is a fixed constant 
vector and M > 0 is a scalar. In this case, the discrete projection algorithm 
which guarantees that 9]. £ S Vfc is 


9k + 1 
9k + 1 


9k + A^V J 

I 9 k+ 1 if \9 k+ i - 0 O | <M 

\ «o + i" if I«‘+1 - «°l > M 


(B.22) 


Letting the step length become infinitesimally small, we obtain the 
continuous-time version of (B.19), i.e., 


9 = Pr (9) 


I-Vg(Vg T Vg)- 1 Xg T 


VJ 


(B.23) 


Because of the sufficiently small step length, the trajectory 9(t), if it exists, 
will satisfy g{9(t)) = 0 Vf > 0 provided 0(0) = 9q satisfies g(9o) = 0. 

The scaled version of the gradient projection method can be obtained by 
using the change of coordinates T±9 = 9 where Ti is a nonsingular matrix 
that satisfies T = TiT^ and T is the scaling positive definite constant matrix. 
Following a similar approach as in section B.2, the scaled version of (B.23) 
is given by 


9 = Pr(0) 
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where 


Pr(0) = - I- rVg{Vg T rVg)- 1 Vg 




tvj 


The minimization problem (B.18) can now be extended to 


minimize J(0) 
subject to g(9) < 0 


(B.24) 


(B.25) 


where S = {9 £ IZ n \g(9) < 0} is a convex subset of IZ n . 

The solution to (B.25) follows directly from that of the unconstrained 
problem and (B.18). We start from an initial point 8q £ S. If the current 
point is in the interior of S, defined as 5o = {9 £ 7 Z n \ g(9) < 0}, then the 
unconstrained algorithm is used. If the current point is on the boundary of 
S, defined as S(S) = {9 £ lZ n \ g{9) = 0} and the direction of search given 
by the unconstrained algorithm is pointing away from S, then we use the 
gradient projection algorithm. If the direction of search is pointing inside S 
then we keep the unconstrained algorithm. In view of the above, the solution 
to the constrained optimization problem (B.25) is given by 


9 = 


—VJ(9) if 9 £ 5oor 9 £ 6(S) and 

— V J + y g g Ty y VJ otherwise 


where 0(0) £ S or with the scaling matrix 


- VJ T Vg < 0 

(B.26) 


f — TVJ(0) if 0 £ 5 0 

0 = J or 0 £ 8{S) and - (TVJ) t V 5 < 0 

I -TVJ + r J-^ TVJ otherwise 

V Vg'TVg 

(B.27) 


B.5 Example 

Consider the scalar time varying equation 

y(t) = 8*u(t) (B.28) 

where y , u : IZ + i— > IZ are bounded uniformly continuous functions of time 
and 9* £ IZ is a constant. We would like to obtain an estimate of 0* at 
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each time t from the measurements of i/(t),u(t), 0 < r < t. Let 9(t) be the 
estimate of 6* at time t, then 


y(t) = o(t)u(t) 


is the estimate of y(t) at time t and 


e(t) = y - y = -(9(t) - 9*)u (B.29) 


is the resulting estimation error due to 0(f) ^ 6*. We would like to generate 
a trajectory 0(f) so that e(f) — > 0 and 0(f) — > 9* as f — > oo by minimizing 
certain cost functions of e(f) w.r.t. 0(f). 

Let us first choose the simple quadratic cost 




{9 - 0*) 2 u 2 (t) 
2 


(B.30) 


and minimize it w.r.t. 9. For each given t, J(9,t ) is convex over 7 Z and 
9 = 9* is a minimum for all t > 0. J(9,t ), however, can reach its minimum 
value, i.e. , zero when 9^9*. 

The gradient and Hessian of J are 


VJ(0, t) = {9- 9*)u 2 (t) = -eu (B.31) 

V 2 J{9, t) = H(9, t) = u 2 {t) > 0 (B.32) 

Using the gradient method we have 


9 = yett, 0(0) = 9q 


(B.33) 


for some 7 > 0. Using the Newton’s method, i.e., (B.15), we obtain 


9 



—eu + 



- 4 - [u + 2 u\ 

u z 


(B.34) 


which is valid provided u 2 > 0 and ii exists. 

For the sake of simplicity, let us assume that u 2 > cq > 0 for some con- 
stant Co and u G Dx and analyze (B.33), (B.34) using a Lyapunov function 
approach. 

We first consider (B.33) and choose the function 


V(9) 


02 

27 


(B.35) 
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where 9{t) = 9{t) — 6* is the parameter error. Because 8* is a constant, we 
have 8 = 9 = 7 eu and from (B.29) e = —6u. Therefore, along any solution 
of (B.33) we have 


V = — = Oeu = — d 2 u 2 < —cod 2 < 0 
7 


which implies that the equilibrium 9 e = 0, i.e, 9 e = 9* is e.s. 
Similarly, for (B.34) we choose 


V(9) = 


8 2 u A 

2 


Because u 2 is bounded from above and below by Co > 0, V(8) is positive 
definite, radially unbounded and decrescent. The time derivative V of V(9) 
along the solution of (B.34) is given by 


V = u a 99 + 2 9 2 u 3 ii = u 2 e0u + 2 u 2 e0u + 2 9 2 u 3 u 


Using e = —8u we have 


V = —9 2 u 4 - 2 9 2 u 3 u + 2 8 2 u 3 u = - 8 2 u 4 < -c 2 0 9 2 

which implies that the equilibrium 9 e = 0 i.e., 9 e = 8* of (B.34) is e.s. 

Let us now assume that an upper bound for 9* is known, i.e., |0*| < c for 
some constant c > 0. If instead of (B.30) we consider the minimization of 


J(M 


{ 9-8 


*\2 


-U 


subject to g{9) = 9 2 — c 2 < 0 
the gradient projection algorithm will give us 


f 7 eu if 8 2 < c 2 

8 = < or if 8 2 = c 2 and cad < 0 (B.36) 

0 otherwise 


where 0(0) is chosen so that 0 2 (O) < c 2 . We analyze (B.36) by considering 
the function given by (B.35). Along the trajectory of (B.36), we have 

V = —d 2 u 2 < —2'ycoV{9) if 8 2 < c 2 or if 8 2 = c 2 and cud < 0 
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and 

V = 0 if 6 2 = c 2 and euO > 0. 

Now 6 2 = c 2 and euO > 0 =>• 66* > c 2 . Since |0*| < c and 6 2 = c 2 , the 
inequality 66* > c 2 is not possible which implies that for 6 2 = c 2 ,eu6 < 0 
and no switching takes place in (B.36). Hence, 

V(0) < -2 jc 0 V(6) Vt> 0 

i.e., V(ff) and therefore 6 converges exponentially to zero. 
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Projection, 202, 604, 789 
Proper transfer function, 35 
Pure least-squares, 194 

Radially unbounded function, 110 
Region of attraction, 106 
Relative degree, 35 
Riccati equation, 460 
Richness of signals, 255, 643 
dominant, 643 
sufficient, 255 

Robust adaptive control, 635 
Robust adaptive law, 575 
Robust adaptive observer, 649 
Robust parameter identifier, 644 

Schwartz inequality, 71 
Self-tuning regulator, 16, 435 
Semiglobal stability, 661 
Sensitivity function, 16, 136 
Series-parallel model, 152 
Singular perturbation, 535 


Small gain theorem, 96 
Small in m.s.s., 83 
SPR-Lyapunov design, 170 
Stability, 66, 105 

BIBO stability, 80 
internal stability, 135 
I/O stability, 79 
Stabilizability, 498 
State space representation, 27 
State transition matrix, 28 
Strictly positive real, 126 
Sufficient richness, 255 
Swapping lemma, 775 
Switched excitation, 506 
Sylvester Theorem, 44 
Sylvester matrix, 44 
Sylvester resultant, 45 

Time-varying systems, 734 
Transfer function, 34 
minimum phase, 40 
relative degree, 35 
Tunability, 415 

Uniform stability, 106 
Uniformly complete observability 
(UCO), 90, 221 

Uniformly ultimately bounded, 107 
Unnormalized MRAC, 343 
Unstructured uncertainty, 632 
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